PHYSICS 330: Quantum Field Theory | 


Lecturer: Professor Bernhard Mistlberger 
Notes by: Andrew Lin 


Autumn 2022 


Introduction 


Professor Mistlberger can be reached at bernhard@slac.stanford.edu, and the TA (Kevin Zhou) can be reached at 
knzhou@stanford.edu. This course will have two lectures a week (on Tuesdays and Thursdays) from 9-10:20am, and 
we may attend optional section classes Thursdays from 12-1:20pm (for further discussion on homework problems and 
material) in GESB 150. 

Homework will be assigned weekly, going over examples that are related to lecture and helping us keep up to date 
with the mathematics. They'll be due on Tuesdays at midnight, and the first one will be released this afternoon. 
(Grades will be based on the best eight of our ten homework grades, as well as a final take-home exam exercise.) For 
any questions, we can attend Professor Mistlberger’s office hours from 5-6pm on Mondays on Zoom. 

Since the students here have different academic backgrounds, we'll start off relatively slowly and encourage ques- 
tions. Our goal is to cover the first seven chapters of Peskin and Schroeder's “An Introduction to Quantum Field 
Theory,” hopefully making us familiar with the ideas and basic structures in the subject. (There is lots of material 
that we can find in other books too — some good sources are Srednicki’s “Quantum Field Theory,” Itzykson and Zu- 
ber’s “Quantum Field Theory,” Zee’s “Quantum Field Theory in a nutshell,” and Weinberg’s “The Quantum Theory 
of Fields.” The last of these texts has lots of material and is written by a giant in the field, but is potentially hard to 


learn from on a first pass.) 
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Today’s lecture will mostly be a “philosophy lesson,” thinking about the general preamble to QFT and the question 
of “why.” When the field first started, the goal was to unify quantum mechanics and special relativity, but the 
problem is that quantum mechanics works with small scales (such as atoms) and special relativity with fast speeds 
(near the speed of light). The field of (high-energy) particle physics is a natural place where these areas interact, and 
that’s where everything started, but QFT can be applied today to condensed matter physics (crystal excitations and 


interactions), cosmology (the Big Bang, inflation, correlation), gravitational wave physics. 


+ One consequence of Einstein's E = mc? is that if we put a certain amount of energy into a box and shake it 
hard enough, basically anything can come out of it. Quantum mechanics lacks a way to explain how this process 


actually occurs, and facilitating the creation of new particles and antiparticles can be answered by QFT. 


+ The problem of causality (one of the ideas of relativity) will also come into play — if we consider the correlation 


function U(t) = (x|e7!"*| x9) for the free Hamiltonian H = ins we can insert a complete set of states and 


calculate 


if RS —im(x—xo)?/(2t) 
p) (p|x0) = (sm) ? 


U(t) =f oe (xertint 


and the point is that we'll always have some probability density to carry x to xo in a fixed time t, which Is bad 
because particles are only supposed to be able to travel at (less than or equal to) the speed of light. And even 
if we use a relativistic version of this with E = ./p2 + m2, that still doesn’t solve the problem for us! So a 


different framework is indeed needed, and QFT does this in a weird but elegant way. 


+ In QFT, we are postulating that everything in the universe (except gravity) can be described with just a set of 
fields. So there's a level of reductionism here — we're saying the fields are spanning the universe and the rules are 
the same everywhere. However, while there's been trouble making observations and measurements to actually 
probe the relevant scales, there doesn’t seem to be any kind of modification needed to the theory like in classical 
mechanics (except gravity). But that doesn’t mean we should be studying subjects like chemistry with QFT — 


it's not very tractable. 


Fact 1 
We'll be using natural units in this class — there’s only a few actual numbers that the universe throws at us, such 


as the cosmological constant A ~ 107-52 m~?, Newton's constant Gy © 6.7 x 1071!m?/kg-s, the speed of light 


c=3x 108 m/s, Planck’s constant f = 10734J -s, and the Higgs mass mp & 125 GeV. We won't think much 


about the first two of these, but we will use units where fh = c = 1 (because for the purposes we're dealing with, 


this is the natural sense of scale). And we can use dimensional analysis to recover usual units if we need. 


We'll use square brackets to denote dimension in mass, so [m"] = n means the dimension of mass in that quantity 
is n. So we're setting [fA] = [c] = 0 and [m,] = 1, and following through the calculations yields [s] = [L] = —1 
(so 1 second or 1 meter has units of inverse mass in natural units). Our energy unit will be the electron volt 
(1 &V © 1.6 x 10719 kg - m?/s?, which is the same as 1.783 kg c? - 10-96 = 5.1 x 10°£. So we can switch back and 
forth between kilograms, meters, and seconds by just introducing the necessary fis and cs, and we'll drop the additional 
fs and cs from here. And this is convenient because the proton mass is about 938 MeV (so around 1 GeV), while the 
mass of the electron is 511 keV and the mass of the Higgs boson is about 125 GeV. Energy units should generally be 


thought of as corresponding to length scales — for example, it’s useful to keep in mind that oS = 200 MeV. 


Remark 2. Tossing fi and c is done because in most problems we discuss, they don't add much to the calculation. But 
keeping mass as a relevant quantity will be important, and we'll see that moving forward. For example, we find that 
the charge radius of a proton is OGY: which is comparable to its mass. (And we should remember that mass and 
energy are basically equivalent.) However, we should make sure this correspondence is only thought about in terms of 


elementary particles rather than composite structures. 


Example 3 


As a naive example, light with a wavelength of 600 nm translates to an energy of roughly 2 eV (using F = hy = fey, 


so we can resolve structures at a 600 nm length scale given a particular microscope of that energy. Electron 
microscopes have an energy of 511 keV, corresponding to a length scale of 2.5 x 10712 m — beyond the rest 
mass, we start getting in regimes where everything becomes quantum mechanical. (The best we can actually do 


is 50 x 1071? m with current technology.) 


Our next point of discussion is the question “what are fields?” For example, temperature is a scalar field, where 


there is a temperature value at every point in spacetime. (And the electric field is similar but is instead a vector field.) 


The subsequent questions are then where those fields comes from and what they’re made of, and we'll answer those 
in due time. We usually take think of coordinates (x, t) as operators (X, f) in quantum mechanics, but that won’t be 
done in quantum field theory — t and x will actually just serve as coordinates, and instead the fields $(x, t) will be 
what are promoted to operators dh instead. So everything acting on wavefunctions will be a field operator, and once we 
learn how to do quantization of fields we'll start to see expressions like w(x, t)|Q). (And the reason that quantizing 
x and t doesn’t work is that when we try it, we run into negative probabilities, lack of ground states, and so on. But 
we can read Srednicki for more on this.) 

Our intuition might be that each point in spacetime influences its “local” neighbors, and to reach other points we 
must propagate through spacetime. That's what will ultimately be necessary for being compatible with relativity, and 
we need a way to describe time-evolution. In classical quantum mechanics, we have this idea that there is a differential 
equation 


100 |b) = H(g, 06) |v), 


where we introduce canonical coordinates and their derivatives (momenta). To make sure that our theory is indeed 
local, it’s important that our evolution operator is only being evaluated at a single point, and then Lorentz invariance 
requires us to have as many time derivatives as spatial derivatives. But we still haven't discussed what these fields are 
made of — at the end of the day, the idea is that we put a harmonic oscillator at every spacetime point, and they 
interact with their neighbors. We then run into issues with infinities everywhere, and the work that we'll be doing is to 
remedy this in a clever way. And one way to make sure this is “relativistically fine” is to describe everything with wave 


equations: 


Definition 4 


The Klein-Gordon equation is the differential equation ot m?o = 0, where we are using four-indices: we have 


Or eon ae = 2 2 where we use the metric nev’ = diag(1,-—1,-1,—1). (If we are using three-indices 


Oxe Bx; OXF! 


instead, we'll use the Euclidean metric.) 


To understand where the Klein-Gordon equation comes from, we can start by imagining this problem in two 
dimensions. Suppose we have a string stretched from 0 to xo with a vertical displacement f(x) at point x; the force 


equation due to tension is then 


_ df(x+dx)  df(x)] _ d?f 
Pant dx dx | | Vag 
and by Newton's second law this is the same as 
en et 
IT pe PON 2 


Collecting terms, this gives us the familiar wave equation 


a a re Ls 
dt? dx? ’ p- 


(In other words, we get of = 0.) But if we now embed our string into a sheet of rubber (which restores the string 


back) and get an additional F = —Y dxf (x, t) force (where Y is Young's modulus), we end up instead with the equation 
oof + m?’f = 0, where m= /%- So this equation can indeed describe a classical system that we're used to, but we'll 


describe relativistic particle mechanics with it going forward and we'll discuss that next time. 
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Last lecture, we ended by discussing the relativistic Klein-Gordon wave equation — it'll play an important role in quantum 
field theory, and we saw a classical example (of a string embedded in rubber) in which that wave equation naturally 


arises. The equation looks like 


fe) fo) fo) 6) 
e+m)o=0, B= 
(0, Me BOP O@ Ox Ox 
(in which we've already set natural units to suppress the factor of c? in front of the spatial derivatives). Today, we'll 
use a classical system to see what we'll be doing for a good amount of the rest of the course. This should give us some 


intuition, and it should only be followed somewhat heuristically to give us the general philosophy of the approach. 


Example 5 
Consider a one-dimensional crystal consisting of a line of atoms each with some vertical displacement, and let ¢, 
be the displacement of the nth atom. There will be some kind of correlative restorative force between the atoms, 


as well as an overall restoring force (like the “rubber” from last time). This system has Hamiltonian 


co 


1 il 1 
H = Ekin + V = Y) 5(80bn)? + 5 (bn — On4s)? + 5 POF 


n=—oco 


(Here 09 = 0, and the second term will look more like a spatial derivative if we shrink the distance between the 


atoms and rescale appropriately. ) 


We'll impose commutation relations as we do in quantum mechanics, specifically 


[bn, on’ = [Oo¢n, Oobn’ = 0, Ibn, Oobn] = Onn = bnOon _ Oodbr dn. 


So @y is playing the role of g, and Oo@, Is playing the role of g. In other words, , “creates” displacement and Oodp 
“changes” it, and quantum mechanics is saying that these two operations do not commute. Then like in Lagrangian 
mechanics, we have the canonical momentum 

OL 
O(Oo¢n) 


(we're being sloppy about Lagrangian vs Lagrangian density; this will be clarified later) where 


Mn = Oobn = 


co 


L = Exin-V= S> tnd0bn — Hn. 


n=—oco 


This Lagrangian has the symmetry n> n+ 1, so it’s useful to do a Fourier decomposition: we write 


® dk ape "dk yo 
bn= fave iK), =f eRelhi(hy, 


where we make the particular choice that o!(k) = 6(—k) and Mt(k) = M(—k). This then tells us some information 


about commutation relations: 


[6(k), &(K’)] = [FI(A), ACK’)] = 0, (BCA), ACA) =F SO eK = on 5(k — K’), 


n=—oo 


as long as we make the assumption that @ and 7 are periodic. Putting this back into our Hamiltonian, we have 
—— ; ra 
H==— bl (k)F(k) + $'(k)[m? + 2(1 — cos k)]O(K)] . 
—T 


(We won't worry about the dimensions of m here — just treat it as a parameter.) But now if we set we = 
/m? + 2(1 — cos(k) and let 


1 on te 1 Fe 
ae [webe + IA(A)], ak = Atta [wbx — HN(A)| , 


we find that 
[ax, af] = 5(k—k’),  [a, a] = [at at] = 0 


so that our crystal Hamiltonian simplifies to 
1 as 
H= =f dkw[a(k)'a(k) + a(k)a!(k)] 
oar | 


which is the harmonic oscillator Hamiltonian in Fourier space. (This should all look similar to converting from x and p 
to creation and annihilation operators.) We can then check the commutators of H with a and al; if we have an energy 


eigenstate |F) of energy E, then 
H(ax |E)) = Eag |E) + [H, ax] |E) = (E — wx) (ax |E)), 


and similarly we find that 
H(a}. |E)) = (E + wx)(af |). 


So a, and al lower and raise the energy by some factor, and that allows us to create the spectrum by defining the 
vacuum state |0) satisfying a, |0) = 0 and then having |w,) = a. |0) as usual. And the way these energy eigenstates 
look is that we have the kth Fourier mode by collectively oscillating the particles in a sine shape — this gives rise to 
phonons on a crystal. So going away from displacements ¢@ and turning to collective “particles” w, will give us a sense 
of what quantum fields are doing — there, we'll promote fields #(X, t) to act on wavefunctions, where X and t will be 
the points in spacetime that tell us where oscillators are sitting. 

To put this all on more systematic footing, we'll now think about quantization. The two main ways to do this 
are canonical quantization and path integrals, and we'll discuss the latter only in QFT II. The former builds up the 
concepts in a more pedestrian way, but it’ll take more work and be less “natural.” Recall that the least action principle 


tells us that the motion taking us from to to ty is the one that minimizes the action 


S = [ tara [ otecve. du), 


and the difference here is that £ will be the Lagrangian density of fields (which we'll just call Lagrangian). So we want 


to solve 6S = 0 to get the classical trajectory; using the chain rule yields 


6S = [atx S00 + 3G, say .2)| 


=| atx | 5500 2 (san) on (aan)| 


(Here we're going to assume smoothness so that we can interchange infinitesimal changes and derivatives.) But 


if we demand that everything falls off to zero at the boundaries of the spacetime, the total derivative term at the end 


will always be zero, and what we're left with is the Euler-Lagrange equations 


a al aL 
Op" \ AA.) ] 


For example, with the Klein-Gordon field, we have 


OL 
op 


OL 
(0,6) 


me 
2 


£=3(0,0"-2e =-mi4, a. (sar) = 8.(0%8) = abe, 


so we get back (a7 —m?*)@ =0. And Lagrangians are useful because they allow us to see symmetries such as Lorentz 
invariance, and we'll go a bit through that to make sure we’re on the same page. 


¢ In ordinary two-dimensional space, we rotate a point by multiplying by a rotation matrix 


x'| _ |cos@ —sin@| |x 
y! sind cosé@ | ly] 
with the nice properties that R(y) = R(@)R(6) (the product of rotation matrices is a rotation matrix) and 


0 -1 
1 = R(@)R~1(@). And we can generate rotations out of infinitesimal movements — if r = : , | then 


R(@) = e°. 


In three-dimensional space, we have the space SO(3) of rotations generated by rotations around z, y, x: 


Oo —-l1 0 


x 
II 


0 0 
0 0 OO}, |]0 0 -I1 
0 1 0 


with the generators satisfying the Lie bracket [r’, /] = e¥* rk. 


Finally, the Poincare group consists of the transformations which are a Lorentz transformation A plus a translation 
a: 
x# ey xH = AML XY 4 gl 


where the (linear transformation) A“, is given by ox and where g°? = g#”/?,,A°,, so the metric (we're using 


the (+,—,—,—) metric) is left invariant under Lorentz transformations. Specifically, this includes translations 
(1, a), homogeneous transformations (A, 0), rotations (R, 0) (in which only the spatial coordinates are affected), 


and Lorentz boosts. Examples of the latter two look like 


1 0 0 0 cosh@  —sinh@ O O 
Ru, = 0 re —sin@ 0 LH, = —sinh@ cosh@ O O 
0 sin@ cos@ O 0 0 1 0 
0 0 0 1 0 0 0 1 


respectively. Furthermore, the Poincare group also includes spatial inversion (P, 0) (the diagonal matrix diag(1, —1, —1, —1)), 
as well as time reversal (7,0) (diag(—1,1,1,1)) and the composition of those two (PT,0). We call transfor- 

mations proper if det(A) = 1 and a = 0 and orthochronous if A°9 > 1 (so time reversals are not allowed); the 

only proper orthochronous transformations are the three rotations and the three boosts, and they can always be 
decomposed as 


My, = ae + wy, 


where w is “infinitesimal” in the sense that O(w*) = 0 is very small. Then plugging into the identity from before 


tells us that 
gy = Mpg’ Mo = gh + uhY + wth + O(w?), 


so we must have an antisymmetric tensor w"” = —w’" and thus our tensor looks like 
1 yyot 82 gj 
—yl = 12 13 
Ay W 1 W W 
aj? ajl2 i ws 
gos 4si8 “jae 1 


(notice that we have upper indices this time). 


Finally, thinking about how our fields transform, imagine we have a scalar field (so for example there is a “blob of 
temperature” in our field). Then we can notice that (x) 4 @$'(x’) = @(A~1(x’)) (so it looks like the field in the old 
coordinates is reached by the inverse Lorentz transformation); this is something we should try going through on our 


own. And if we have a vector field instead, the direction of the field also has to transform — we have 
AP Hy AM ( x’) = MAY (A7tx). 


(Things like spin are more complicated, and we'll talk about it later on.) 
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Last lecture, we looked at our first field theory (a classical field theory) and talked about it in terms of writing down an 
action S, leading us to the Euler-Lagrange equations which give us the equation of motion. Importantly, there will be 
an Euler-Lagrange equation for each field sitting in our space. In particular, the classic Klein-Gordon field Lagrangian 
(density) led us to the familiar equation (87 + m?) = 0. 

We mentioned that symmetries will play a big role in our discussion going forward, and one important feature of 
the equations we're discussing is that they're Lorentz-invariant. So last time, we also looked at the Poincaré group 


(Lorentz transformations plus a translation) and started thinking about how coordinates transform. 


Remark 6. /n general relativity, there is an effort to distinguish between active and passive Lorentz transformations, 


but we don’t need to be so careful here. The main point is that “the field still looks the same.” 


It turns out that Lagrangians help us figure out symmetries — for example, suppose we have a Lagrangian for a 
complex scalar field 
L= 0,90" — mo" 
and consider the transformation ¢ ++ e'%¢ (so that o* 1» e7'%g*) for some constant a. This keeps the Lagrangian 
constant (meaning that the overall phase of @ doesn't matter), and we'll see less trivial (but useful) examples moving 


forward — this leads us into Noether’s theorem, a useful way to establish symmetries in a system: 


Proposition 7 (Noether) 


Every continuous symmetry of the lagrangian £ gives rise to a conserved current j*(x), such that the equations 


of motion yield 0,4 = 0. 


Proof. Let $, be a set of fields, and suppose that we have an infinitesimal transformation $5(x) = 6; + €Ad,(x). 


Then 6¢, = €Adz, so by the chain rule we have 


OL 
OL =e ab, Ad,(x) + TG, i 


Applying the same trick as last time of separating out a total derivative, we can write this as 


OL OL OL 
— Ads Adz }). 
(5 a (555) ? +84 ( ses Onda) 6.) 


The first term here is zero by the usual Euler-Lagrange equations, and now if we have a symmetry then the corresponding 


FAA oO (Ada). 


transformation must satisfy 6£ = 0. This gives us 


OL _ —_ OL 
Ou, (sea fhe) =0 =) => 7 (0, 58,6.) 09? 


Notice that it’s also okay for the Lagrangian to change as a total derivative OL = 0,A" (this yields 6S = 0), so 


we can actually have 
OL 


O(Ouba) 


as long as AY keeps the action constant. For example, translations in the Poincaré group correspond to four symmetries 


fs eda 


and thus four conserved currents. 


We also have corresponding conserved charge Q = Pax), and notice that this means we have 
0=HQ= / xO Pix == / d?xdjj'(x), 


where in the last equality we've used the fact that 0,ju4 = 0. And this last integral is (by the divergence theorem) 
— te dA-j over the boundary A, which can be intuitively thought of as “only caring about the flux of our current within 


our piece of spacetime.” 


Example 8 


Consider the Noether charges corresponding to the translation x!’ = x# + a4 (where a is infinitesimal). 


The corresponding Lagrangian transformation is then 
L(x) H L(x — a) = L(x) — aan L, 


so that 6£ = a4O,£L = adr (nL) (note that we'll regularly switch between 7 and g here, and 7” is always the 


“constant” (+,—,—,—) metric). Inserting the metric here is a “common trick:” the variation in a field @ is then 
6¢=—aO,d, 60,6 = —a"O0, 0,0. 


Collecting terms and putting them into the definition of the conserved current, we get 


OL 
O(O,) 


where the second spacetime index v indexes the different currents we get from the four different translations (for a 


—— Ob - nL, 


f= 


single transformation we would only have j“) and where we've removed the arbitrary constant a” (we could choose 
it to pick out a particular component). And this j4, that we've calculated is the energy-momentum tensor 7“,, 


which is conserved (though it's not the one we're used to from general relativity, there is some connection). The 


corresponding conserved charge is then 


— 3 Oi a 3 OL 
pm fxr = f ox (saae- re), 


Cho 3 00 __ 3 00 
p= | dxr = | #(aaqoo- ne). 


OL(x) 
O(A0¢)’ 


and in particular we have 


we see that 


and remembering that the canonical momenta are defined as m(x) = 


p° = | x (Ne00- 2) 


is the usual formula for the Hamiltonian (this pg — £ form is called a Legendre transformation). And similarly doing 


the same thing for three-momentum yields the momentum operator (exercise for us). 


Remark 9. /n everything here, we have £ as a function of x, but we shouldn't think of putting indices on it (it’s 


‘always contracted” and won't appear on its own). 


Now that we have our Hamiltonian, we can talk more about quantization, and we'll start by thinking about the 
canonical quantization of scalar fields with our Klein-Gordon Lagrangian 


m2 


2 
ee 


1 
= 5 (0.6)? 


with [] = 


(so that we have I = Oo¢@ in this case). Then the Hamiltonian is 


H= f axn= f a%x(nae-c) = f ax (SP 5(0)? | m6?) 


and now (much like we quantized x and p) we think of M and @ as operators. If we imagine this to be a classical field 


OL 
0(00¢) 


theory in the Schrodinger picture, so that [ and @¢ are only spatially dependent (or we're only looking at a particular 


time slice), then the commutation relations we are imposing (similar to the one for x and p) are that 
[0(%), 6] = (8). NG) = 0, [6(%), NG)] = 15K — J). 


Again introducing Fourier modes (this is a transformation that we can do) 
dp 


: ip-x —ilp 
(2m)3 ,/2u, oa Pf Pratnye —al(p)e it - xX) 


where we require the Fourier space relation ¢'(—p) = $(p) (so that ¢ is real) and w, = \/p2 + m?, and inverting 


these relations by adding in a delta function term [ d?xe!P*11(X), we get 


$(X) = (a(p)e®* + al(p)e- x), 1X) = 


a(p) = [ a°x «(up +68), al(p) = Px (up '(3) ~ (9). 


The commutation relations are all zero except [a(f, a‘ ()] = (2m)?6) (p— p’), and plugging back into our Hamiltonian 
gives us 
H= | doupla' (p)a(b) + a(@)a"(@), 


which by the commutation relations can also be written as 


= | Foun (state) + Zoe. '(0n), 


which looks a lot like the harmonic oscillator with a particular frequency. So we can do the same thing as last time, 
seeing how this acts on creation and annihilation operators to create our spectrum and Hilbert space of states. We 
have 

[H, a! ()] = wpal(p), [H, a(B)] = —wp (8), 


so we do have creation and annihilation operators and we can define a vacuuum state |0) such that a(/) |0) = 0 and 


excited states |) = ,/2w,a'(f|0); we can then define states (which we'll eventually interpret as particle states) 


|Pi, Bas ++ Dn) = \/2Wp, ++ V/2tp, a! (1) +++ a! (Dn) [O) . 


These fs can be interpreted as momenta: indeed, H|f1,--- , Bn) = (Wp, +:--+Wp,) |Di,--> , Bn), and al(p)al(G) |0) = 
a'(@)a!(p) |0) for any two momenta fp, 7 (because our commutation relations tell us that the a! all commute, we have 


bosons). So we can form the states (a!(f))” |0) for any integer n. 


Remark 10. Trying this with “position excitations” instead of “momentum excitations” will be more challenging, but 


we'll discuss this later. 


Taking the momentum operator P4# = [ Px ae OM — g°L from our earlier discussion of Noether’s theorem, 
we find that 


3 
PZ i d?xM(x)0'6(X) = / SayaP'al (P)alP) 


and so we have a number operator N = al(p)a(p). And because [P’, H] = 0, what we learn here is that the states |/) 
can be taken to be momentum eigenstates with energy wg. So the point is that we do a Fourier transform and see 


how the Hamiltonian acts in Fourier space, giving us the usual thing from classical physics. 
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Last lecture, we wrote down the proceedings of how to quantize classical field theories. To review the major steps, 
we started with the classical field theory in four dimensions with Lagrangian density £ = $(Ou)? — mo? to get the 
canonical coordinates [] = ToD) = Oo¢. This allowed us to write down the Hamiltonian density, giving us familiar 
conserved quantities. 

From there, we quantize by promoting our fields ¢, I to operator versions (this seems to break Lorentz invariance, 
but we'll ignore it for now) such that [¢(X), 6(Y)] = [N(%), N(V)] = 0 and [4(X), N(V)] = 16°) (xX — 7). If we then do 
a Fourier transform (mode expansion), using the frequency w, = Jp + m, we have 


apy Wis @ ies By dps 2 25 ai 
X= | — | Jae’?* — atel*) A(X) = - elPX a 4 ai elPX] | 
(More generally, we could have used a and b instead of a and al, but if we want @ to be real we need b = ay 
Requiring that the only nontrivial commutator here is [a(f), a (f")] = (27)26) (7 — f), we recover the Hamiltonian 
for a harmonic oscillator at every point in momentum space. So we can find similar energy eigenstates by starting 
with a ground state and applying creation al operators to them, and |p) are also energy eigenstates of our Hamiltonian 


with a definite momentum. 


Remark 11. /t turns out that the vacuum state is not always unique — it will matter if we have something called a 


“topological excitation,” This comes up in various applications but is far beyond the scope of this class. 


All of the field theories can be quantized in this kind of way, but there’s a lot of clean-up that we're going to 


10 


have to do first. In particular, we chose a particular time-slice t = O for all of this argument, which breaks Lorentz 


invariance. 


Before that, we'll discuss the bizarre vacuum energy of this Hamiltonian. We can rewrite this Hamiltonian 


3 
f= / aan [al (P)a(B) + a(P)a"()| 


d°p TR) a(R otro 
om? [2at (a) a(B) + [a(p), at (a)]] , 


and it’s now a problem that this commutator is a delta function, since 6(3)(0) is “infinite.” Furthermore, integrat- 
ing that delta function over d?p gives us another infinity. These integrals actually end up telling us about what's 
“going to haunt us with quantum field theory,” and we'll try to talk about them now. The f d?p infinity has to 
do with extremely high energies and momenta, and we call that an ultraviolet singularity. On the other hand, 
usually we have to specify how delta functions come about to get a smooth approximation, but here because 
we did a mode expansion we know that 6°)(0) = f d?xel(®-P)* = f d?x. So that infinity comes from distances 
that are very far apart (large volume) or momenta p —> 0, and that’s called an infrared singularity. Relatedly, 


there is a collinear singularity if we have multiple particles with the same momentum p = fp". 


But the point is that whatever this “infinity term” is, we'll set that as our baseline energy Eo, and we always 
measure energy values of one state with respect to another and look at H — Eo. (And there isn’t really any 
way of getting around this — as far as we understand, there aren't ways to measure this Eo, but we're somehow 


saying that there is an “infinite energy density everywhere.”) 


Remark 12. Notice that in momentum space, our Hamiltonian looks like H = (wp@ — iN)(wp_pb+ iN), and if we 
impose commutation relations on this Hamiltonian directly we instead get ata with no additional infinity term. 
So we can think of this as the true Hamiltonian of nature, but there’s no real physical difference in the two 
cases because we can always add a constant to H. So this is not something we can really resolve, and it's not 


something we should worry about much. 


One tool we do have to help deal with infinities is the concept of operator ordering (which is denoted by putting 
: before and after the operators). Specifically, we always put annihilation operators to the right and creation 
operators to the left: 


. tT. + ata - = of 
> Apa,» =: Apap : = apap 


In particular, this means our normal ordered Hamiltonian is given by 


d?p i 
SA y= i: (om y3 we 09%- 


(We're not saying that we can ignore commutators in general — it’s just that in this particular case it gets rid of 


the zero-point energy.) 


Our energy doesn't seem like it should be relativistically invariant, and indeed if we look at our eigenstates 


|B) = \/2Epal (5) |0) , 


we have 


(7, l=) V/2Ep/2E, (0|a(a)al (|0) = V/2Ep /2E,5° (B — G) = 2E(p)5 (B— 4). 
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But now if we do a Lorentz boost in the x-direction with coordinates 


Pe = (Px +BE), = YWE+B2) => E=(E'- 98), x = (0 - £8) 


(still no time-dependence here), we find that the inner product above can be simplified via 


E(p)6 (B— G) = E5(px — ax)5 (py,2 — ay,z) 
= ¥(E' — pi.B)5(y(pl — ELB - di, + E1B))6°)(py,2 — dy,z) 


VCE" = 'p.8) 


= Ge anp yee F-%). 


where in the last step we've used that 6(cx) = 1 O(x) for any constant c (filling in the details for the Jacobian 
is a good exercise). So the point is that we end up with £’6(3)(f" — q’) again, and Lorentz invariance is still 


present. 
So if we think about the process of “creating particles,” we can have $(x) act on the vacuum to get 


Bp 1 


(x) |0) = (on)? EH) EP pis 


d? 1 Zaks 
5; ———al (p)e™* 0) = 
p 


(27) «(200 
We can then rewrite this all as 


Z / Copy (2704 (0? — P)el®* |p), 


where we define 6, (p? — m*) = 6(p* — m?)©(p°) where © is the Heaviside function. (Basically, we're picking out the 
Po Component that gives us the energy, because there's a positive and negative solution for po that is picked out by 
the delta function in [ dp°5(p3 — p? — m?)©(p°), and we only want to take the positive one.) This will be interpreted 


physically as “creating a state of X:" to make analogies to quantum mechanics, we can calculate that 
(0/4(%)|5) = e'?™, 


which is the position space wavefunction (since we're still doing everything at t = 0 here), and similarly 


(010%) = f GAab (0? — mye (a 


can be thought of as “annihilating a particle at xX.” And because everything is contracted nicely, there’s no problems 
with Lorentz invariance here. (And in general, we often think of things as momentum eigenstates, so we shouldn't 
think too much about the particles actually being localized at X — that’s just a label.) 

But we still haven't talked about time-dependence up to this point, and now we need to figure out how to bring 
it back into the picture. Recall that in the Schrodinger picture, our operators are static, while in the Heisenberg 
picture, operators involve in time. And for our purposes, it makes sense to take the latter and define our operators to 
evolve in time as 

Ox) = 0f%, 1) = eOC oe", Ti se" NGA ole="*. 


In momentum space, this looks like 


d3 1 
o(%) -| oe 4/ 2Wp fer tiae) ae 


where we now have that px are contractions of momentum and position four-vectors (rather than three-vectors), and 
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the time-dependence is encoded in 


eae =. e ECP) a( py, 


We then can make sense of the expression for 
dep. ih". ag 

3 pr (e 
(21)3 ,/2p0 


We then find that (taking another time-derivative) 


N= 6 = 


IP a — elPX at) 


where (p°)? = p? + m? = —0? + m?. Pulling that operator out of the integral, we thus have 
—Ob= (—03 +m)o => (0% + m*)o(x) = 0, 


which is our original Klein-Gordon equation. And notice that what we've managed to do is to create negative frequency 
states (which will turn out to correspond to antiparticles) which still have positive energies. This is not very cool for 
scalar fields because they will end up “being their own antiparticle,” but we'll see soon (in exercises) that this does 


manifest in a cool way. 
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Last lecture, we followed classical quantization steps and introduced operators in the Heisenberg picture. Specifically, 
we defined (here x is a four-vector) 


(x) = O(%, t) = e* G(X, 0)e™, 


which (for the Klein-Gordon Hamiltonian) yields 


3 
60) = | Eee (ealo) + ™al(0)), 


Then the definition of conjugate momentum I(x) is really the time-derivative of our field 6(x), and we can calculate 
that Oo = 0$¢ = 0;)¢ — m*, getting back to the Klein-Gordon equation. We then found our spectrum and found 
physical interpretations of “creating particle states.” Defining |p) = ,/2w,a'(p) |0), we can write 

dp 


gS Pog I Px 
(21)? 2p Z Ip) 


(x) |0) = 


which allows us to create and annihilate particles “at a particular state” x. 

One thing we didn't clean up last time was the problem of causality, and in order to do that we'll study the vacuum 
correlation function 
Bp dq 1 
(21)3 (21)? \/2Qwp \/2tq 
where we've used the fact that the other term in (x) goes away because applying a to the vacuum gives us 0. Applying 
the commutator (p|q) = \/4wpwq (0|a(p)at(q)|0) = 2wp_(2m)?5 (p — q), this simplifies to 


dp dq 1 
(21)? (27)? 2p 


(0|6(x)6(y)|0) = 


epxtiay (0]a(p)a'(q) |0) : 


(0|6(x)6(y)|0) = 


: d+ : 
eiP(x-y) = / Tome ZrO (0" = m?) e/PO-Y) = D(x —y), 
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where D is what we call a Green’s function. But now if (x — y)* < 0, meaning that our distance between x and 
y is outside the light-cone (we have spacelike separation), then we can calculate this function D(x — y) by using 
Lorentz invariance and noticing that we can do a transformation so that x and y are both chosen to be at the same 
time-slice. So we'll set x° = y® = 0 and get some radial vector 7 = X — y, and (exercise) we can check that 
D(x -y)= eas tee dial aae and in particular this falls off exponentially as r + oo. So the correlation is 
nonvanishing, but that doesn’t mean we have any issues with causality since we're not communicating any signals. 
Instead, the question is to ask whether we can impact measurements at x using measurements at y, and we do this 


by looking at commutators (because they tell us whether the order of measurement matters) 


(0166+), yO) = fC). Ay] = ff Pr 2m8..(p? — m2) [ei@O) — erin» 


(as an important note, we set up equal-time commutation relations [6(x), d(y)] = 0, but the quantization is different 
if we're not doing things at a fixed time. So we have to actually go through the calculation here). We'll now undo the 


Lorentz invariant notation and rewrite in a more complicated way to get 


, 


3 
/ ap _ 1 [e ip(xy) _ @-(—iwu(p))(°-y) —1B(%-7) 
(21)? 2w(p) 


Qn 


and then doing a substitution p++ —p and also “picking out the particular value of p°” lets us rewrite this as 


d°p dpo 
(21)? 2po 


(OlLb(x), &(y)II0) = fee Pp (eh susp) ie ena) 


(so we're now integrating with respect to d*p again). Now we'll write these delta functions as contour integrals: fix 


some infinitesimal ¢ > 0 and define the integral 


=f dD inv) | ( 1 1 ) 
(274) 2p° \ p?—w(p)+ie p®+w(p)t+ie]' 


where this expression now requires us to have x° > y® as a constraint. Combining these terms yields 


_ f FP ip) / 
(Qn) (P+ ie)? — [BP — oP 


Cauchy’s residue theorem tells us that for any contour C enclosing some set of poles {z;} of a function f (meaning 
that f looks locally like o2)) we have 


¢ f(z)dz= 2ni Ss” f(z). 
7 {zi} 
We're actually only doing this contour integral in the p° plane, and the introduction of € will be explained now: we 
have a pole at p® = —Wp, —1€ and at p? = Wp — i€, while the ordinary d*p integral travels along the real axis for p°. So 
now we can “close the contour” and complete our contour C by basically traveling through a semicircle in the negative 


imaginary axis, taking its radius to infinity: 


p° = rcos@ — irsin@,@ € [0, 7]. 


In particular, we see that as r + 00, e/(°-¥")P® = ersin@(x°-y") decays exponentially as long as x° > y°, and the 
length of the arc is only linear in r. Thus the contour integral that we wrote only gets contributions along the real axis, 


giving us the boxed integral /, and using the residue theorem gets us back to D(x — y) (which is basically integrating 


14 


e—'P(*-Y)_ So if we then define the retarded propagator 


d*p i 


(ony (pO TEES BE ee? = (OllO(%), #10) 80° — ¥°) 


Dr(x—-y) = 


and analogously the advanced propagator 


Da(x — vy) = (O|[¢(x), O(y)II0) a(v° — x°), 


we can check that (03 + m?)Da(x — y) = id“) (x — y) (and the same for Dp). These Green's functions are useful 
because they relate to equations of motion if we have our Lagrangian £; and we add in an additional interaction term 
dj (where j is some current), then our equations of motion become (0? + m?)¢ =. And to get a solution for /, we 


can use the Green's function in a convolution 
6) = | dtxDawe— iO) 


(where we use the advanced or retarded propagator depending on the situation); indeed, we see that (07 + m*)@ = 
if d*y — id (x — y)j(y) = J(x), so integrating Green's function (specifically, convolving it with the source) is useful 


for recovering @. 
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The Feynman propagator is defined as 


Dr (x — y) = @(x° — y°)D(x — y) = O(¥° — x°)D(y — x) = (O1d(x) b(y)10) + (OlG(y)(x)]0) . 


(In other words, if x° > y°, we perform (y) first, and vice versa.) 


In operator notation, we'll write this as 


De(x— y) = (O|T {Ox OY) F10) - 


We can give this propagator an integral representation as well: indeed, we see that 


dtp | i 
= y\-= =ID(X=Y)Ju a ew 
DEY) i (2m) © p> — im? +ie' 


where this time one pole will be below the real axis and the other will be above, because (using that ¢ is infinitesimal) 
I 2 ! _ 1 
p?—m+ie (po — ./p2 + m2 — ie)(p0 + /p+m2—ie) (p? p? + m? — je)(p° + \/p2 + m? — ie) 


and we see that this indeed gives us oe ( ). So now the contour integral picks up either one 


1 1 
p®—(Wp—ie) p°+(wWp—i€) 
residue or the other, but it depends on whether our semicircle is in the top half or bottom half of the imaginary plane: 


this is related to whether we have x? > y® or x° < y®. So for the advanced propagator we close above the real axis, 


and for the retarded propagator we close below it, based on which side allows e*/?%—*) to vanish along the circular 
arc. 

With the way we've set everything up, it’s a bit difficult to get rid of time-ordering — we'll continue along this way 
for a while, but we'll see soon that time will disappear from the picture. Everything so far has been free field theory, 


but now we'll talk a bit about interactions: it turns out that what we want to add into our Lagrangian looks like 


ON 
L=Lo- ae O): 
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where A is some arbitrary number. In general, we'll see products of fields (local field operators) like this appearing in 
our interaction term, and we'll see that having more than four powers of fields will not be relevant in most interaction 
theories. (Heuristically, that has to do with energy dimension, but there’s a whole discussion of the Wilsonian renor- 
malization group for that.) Similarly, we can then split our Hamiltonian into a free Hamiltonian and an interaction 
term 

H=Ho+ Hint, Hint = A pdt. 


We can write out a mode expansion of the same ansatz 
Bp 1 
(21)? \/2up 


so that our initial fields will now take the form (this is now the interaction picture) 


0(X, to) = [a(p)e?* + al(p)e”*] , 
G1(%, t) = ell MID(R, toe MOE), 
In the Heisenberg picture, this means 
bu (x) = elH(t—to) e—iHo(t—t0) g(x, fetHolt—t) .—iH(t—-t-0 _ Ut(t, to)@1(t, to)U(t, to) 


for some unitary operator U(t, to) = ef M(t) e/H(t-%) involving the interaction term. We thus have U(to, to) = 1 
and iU(t, to) = e!(t-%) (Hy — H)eW/M(t-) where H — Ho is the interacting Hamiltonian Hint, ; in the Schrodinger 


picture. This then simplifies to 


ef Ho(t—to) py se iHo(t to) ei Ho(t to) eiH(t to) = Hint iU(t, to) = iOgU(A, Ao). 
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Remark 14. When we quantize a classical field theory, sometimes we have products of @s and ms which we could 
originally put in any order, but then the order ends up making a difference when we turn them into operators. It turns 
out tat this just adds a constant to the Hamiltonian (which may be infinite, but that’s not really a problem), which 


never changes the physics and thus yields the same field theory. 


Remark 15. The charge associated with the symmetry ¢ > e'*¢ and ¢* > e~'*¢* is a difference of number operators 
N, — Np; it turns out this will represent the number of particles and antiparticles, and it will turn out to show that 


particles are always created in pairs. 


Last lecture, we calculated the Green's function D(x, y) = (0|¢(x)@(y)|0), finding that (O|[é(x), d(y)]|0) is zero 
for spacelike separated particles (which means causality is not violated). We then used it to write down the Feynman 


propagator 


Dr (x — y) = (O|T{(x), by) 10) = (O1b(x) b(y)10) @(x° — y°) + (O]G(v)H(x)10) O(y? — x°) 


= lim | coe : 
e>ot J (2n)4 p?— m? +i 
which is a manifestly Lorentz-invariant quantity which we obtain by thinking about contour integrals in the complex 
plane. The point is that by solving the free scalar field theory completely, we are now able to calculate correlation 
functions explicitly, and it will turn out that arbitrary correlation functions can be written in terms of this quantity 
Dr(x% — x). 
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But what we've been discussing recently is that we “turn on interactions” by making our Lagrangian more compli- 
cated, varying the original Lagrangian $(O.6)? — sme by adding a term —Ag* for some very small A. We did the 
same quantization procedure: doing a mode expansion and thinking of a and a! as creation and annihilation opera- 


tors, then converting to the interaction picture ¢)(X, t) = e!('-%) OX, to)e/)(t-%) | and we find that for the full 


time-evolution we want the Heisenberg picture with the time-evolution operator 
U(t to) = el Ho(t—to) e—iH(t- to) 


where Ho is the original Hamiltonian and H is the full one. What we're going to do in the next two lectures is find 
a mathematically useful way to calculate correlation functions, arriving at a framework (Feynman rules) that lets us 
do this mathematics quickly. The point is to relate correlation functions like (0|¢4(x)@y(y)|0) to things that we can 
already calculate. 


Last time, we found that we could evolve that time-evolution operator U via 
iAU(t, to) = ef(™) (Hy — Hye Mt) 


(where we've canceled out an e~/4(t—t) @/Mo(t—t0) term, and this right-hand side is basically acting with the interaction 
Hamiltonian in the interaction picture Hip,;U(t, to) (where Hint = H — Ho). This differential equation is a bit tricky, 
but we'll rely on Dyson series for this and do perturbative calculations in \ (which we're assuming to be small). Let 
H, be the interaction picture Hamiltonian for the interaction picture, so that we can make the ansatz 


t 


U(t, toy) =C- if dt'H,(t')U(t’, to) 


to 
where C = / because U(to, to) is the identity operator. Plugging this back into itself, we find that 
t t t! 
U(t, to) =1-if dvHj(e) + (i)? f avH(t) | dt” H,(t”) + O(A3) 
to to to 
and then we can keep repeating this process. But we can also make use of the fact that 
Us th t t 1 t 
[at [death ertitte) = ff dte [ atati(tayin(ts) = 57 { [ atdnmi(aye(e)}, 
to to to ty zs to 


where T denotes the time-ordered product (here we can imagine that to integrate along the square [to, t] x [to, t], we 
can integrate along the regions where t; > to and tp > ty and add them up). So plugging this in infinitely, we find 


that the time-evolution operator is given by (here the $ goes into the series expansion for the exponential) 


U(t, to) =T {eI ses . 


We can check that U(t1, t2)U(to, ts) = U(tr, tg) and that U(ty, t3)U' (te, t3) = U(th, tz), so that U(ty, t2)U' (ty, te) = 1 


(so that we have unitary time-evolution). We can rewrite U also as 
rt rt 
U(t, ty) = T {ete a =i {eS ead 


where L; is the interaction picture Lagrangian because H; = —L, (the free stuff is gone, so the usual kinematics m? 


potential is gone). But the problem here is that we don’t have a vacuum state (because we don’t know how to solve 
the theory exactly). To find the ground state |Q2), we again use the fact that » is small, so that we should expect 


the new ground state and the original one to have significantly overlap. Thus we'll assume that (0|Q) 4 0 and that 
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E; > Eo for all other /. We find that acting on the original vacuum state (and for some time T) 


CO 


ervth loyss Ser |i) (iO) 


i=0 


which we can also write in terms of the new ground state as 


oo 
eh) lo Ses FY |i) 110): 

i=1 
Taking the limit as T gets large, specifically setting T — oo(1 — je) (this imaginary part is like preparing a quantum 
dot in a particular state and having it interact with a noisy environment, so that we get exponential decay at long 
times), we only get the most stable state living the longest, namely the lowest-energy one. Thus we can throw away 
all contributions in this last expression to get 

iEoT 


Ove". 45 
en Bag) 


eTlAT 0) : 


Doing a transformation T + T + to, we can now rewrite this ground state as 


|) _ lim (Qo) e((T+t0)Eo)—1 e-iH(T +t) @iHo(T + to) |0) 
T—oo(1l-e 


where we're inserting a unit because Ho is the free-field Hamiltonian and e!(7 +) |0) = 1]0). But this means we 


actually have a time-evolution operator here, and we find that 


|2) = lim —_((Q)0) eT +E) (4), —T) 0) 


T-00(1-e) 


so that complex conjugating and replacing T with —T yields 


(Q]=_ lim _((0|Q) e7 (7 +)Fo)—1 (Q) UT, to) 


T-00(1-e) 


(Everything here with the vacuum states is in the Heisenberg picture.) And what we want to calculate now is correlation 
functions of the new ground state: we have (converting to the interaction picture and plugging in our expression for 
22) 


(216) 6(Y)12) = [tm (e107) (0}2))  (O|UCT, to) U(to, x) bux) UK, y)di(YULY®, 22)U(to, -7)|0) 


(e‘Fo(@+7) (0)) ‘| | 


To make this look nicer, we'll normalize our vacuum state, which means that 
(212) = 1 = tim (| (02%) Pe*27)  (O|UCT, x° UCR, yYIULY>, fo)U(to, -T)]0) = (O|U(T, ~T)|0) 
oo 


which means that 


(QP)OWIM (OUT. xo UC®, Y)bi(YUY?, =T)]0) 
; | 


T-00(1-e) (O|U(T, —T)|0) 


which is nice because the overlap now only involves time-evolution operators instead of the exponential terms. And 
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this can in fact further be written in terms of a time-ordered product as 


if d*xL, 
Fee eee ok a 0 


But the point is that in the interaction picture, we can calculate everything with respect to the free Hamiltonian, so 


we can calculate the right-hand side. Similarly if we want to calculate an arbitrary correlation function, we have 


tye a if d*xl, 
ee oe i) He 


We'll see next time how to calculate this and write something more concrete down. 
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Last lecture, we analyzed interacting theories and discussed how to calculate correlation functions in the interacting 
Hamiltonian vacuum state |Q). Basically, we found that if we assume the overlap (0|Q) between the free vacuum and 


interacting vacuum ground states, then we could write the correlation function 


(0|7 {4)(4)-+- diane 2 blo) 
ee ae (O/T fel #£/} 0) 


(where we may add a time-ordering on the left compared to what we derived last time because the expression is 
independent of the ordering of the xjs). The point is that we can now calculate in terms of the free Hamiltonian, and 
we're going to see today how to make use of Feynman diagrams to avoid doing too many calculations, starting with 
Wick’s theorem. We need a way to evaluate something like (0/7 {(x1) --- (Xn) }]0), and we should recall that we 
defined 


e7!Px +4 al elPX), 


o= ls (2m) an 
which we can decompose into two terms @_ and $4, corresponding to the two exponentials. Then we can define the 
fields A= (x) = At +A and B= ¢(y) = Bt +B”, and the time-ordered product, if we assume x° > y?®, is 


T{AB}losyo = AB = AT Bt + AtB> +A°Bt+A-B™. 


Having this operator act on the vacuum state means many of these terms will drop out, and this is almost a normal 
operator except for the A~ Bt term. But if we introduce the normal ordering (recall that this is where creation operators 
go to the left and annihilation operators to the right) : BTAt : = A*B™ and observe that ATB* = [A~, Bt] +BtA, 
then we will find that 


T{AB}|xo>yo =: AB: +[A, eal cere 


In the opposite ordering we find that 
T{AB} | yo>xo = : AB + [A‘, BT ||xo>y0- 


But now because we can directly 


(O|[A, B]|0) = (0|(A* + A-)(B* + B-) — (B* + B-)(A* +A )|0) 
= (0|A~B* — B-A*|0), 
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we can put things together to find that | 7{AB} =: AB : +Dr(x — y)|, where De is the Feynman propagator, so 


we get a relationship between the time-ordered product and the normal ordered product. (Remember that we like 


normal operators because they have expectation zero in the vacuum.) More generally, we get following identity: 


Theorem 16 (Wick) 


For operators A = $(x,), B = $(xs),--: , Z = O(Xz) (or generically any number of operators), 


THABG- XV) = ABC XYZ Dea = xa) 6 KY Ze DO =) Bee XYZ ee 


+De(Xa — Xp)DeF(Xc — Xp): EF---XY¥Z:+---+ De(xX4 — Xg)--: De(xy — Xz) + °°: : 


In other words, we contract pairs of variables in all possible ways into Feynman propagators and sum over everything, 
and we don't necessarily have to contract adjacent variables — all possibilities allowed by the combinatorics contribute. 
And now notice that if we take the expectation of this operator in the vacuum state, everything drops out except when 
all of these are Feynman propagators (since normal operators annihilate the vacuum) — for example, that means we 


just have 
(0|T {ABCD} 0) = De(xa — XB)De(Xc — XD) + De (Xa — Xc)De (XB — XD) + De (xa — XD) De (XB — Xc) 


for any operators A = $(x,), B = $(xs), C = (xc), D = o(xp) (and note here that the Feynman propagator is 


symmetric). Graphically, we can imagine drawing out the possibilities as shown below: 


A B A B A B 
C D Cc D C. D 


Remark 17. Notice that if we have an odd number of operators, these expectations will always be zero, because we 


will always have a leftover normal operator which will annihilate the vacuum state. 


We can now return to the correlation function and calculate expectations, and we'll do this with perturbation 


theory. Recall that our interaction Lagrangian £L; = —A 94(x) is small in A, we want to calculate the denominator by 
expanding out the exponential 
id f d*x¢4 ir 444 
(|r {e a Vo} =1-27 o| | d*xd*(x) 


°)} 
~(i? 


HET [atx [av (oletcoo'unioy} + 008) 


Remark 18. We're now going to assume that taking integrals and computing expectations can be interchanged, so 


we can move the integrals outside the time-ordering. 


Applying Wick’s theorem to ¢d¢¢, we see that (0|T {6*(x)|0) = 3D¢(x — x)De(x — x) (we can imagine three 
times the diagram . in which we contract the point x to itself twice). The next term is then 


(0|T{6*(x)6*(y)}|0) = (O|T {bx bxdxbxby by by by|0) F 


and doing the combinatorics turns out to give us the following picture (where the dots represent x and y), where it 


depends on how many times we contract xs to ys versus within each variable: 
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9 to 


(We can work out the details on our own.) On the other hand, the numerator can also be expanded out in powers 
of A. The O(A°) term is (0|T{O(x)o(y)}|0) = De(x — y), the O(A!) term is 


(or {wow (-3) [atze*ca}|o). 


and we can apply Wick’'s theorem again. This time the combinatorics works out to 3 times De(x—y)Dre(z—z) Dr (z—Z) 
and 12 times De(x — z)DF(y — z)De(z—z), and then integrating that over d*z and multiplying b —p. So putting 


everything together, our numerator looks as shown below: 


x y Ir ni x y 
At A pate | 38 t Sedzsta gS 


| +000”) 


Thus we've expanded both the numerator and denominator in small As, so we can do a geometric series expansion 
and keep the leading order term. Putting everything together, we find that the correlation function we wanted is (here 


the 12 cancels out mostly with the 4! in the denominator) 


(Q\T {bu(x)Ou(y)fIQ) = De(x — y) — Sf dtzdr( — z)Dr(y — z)Dr(z — z) + O(A?) |. 


(It turns out that the disconnected diagrams (or disconnected graphs) in which z is not connected to the endpoints 
X and y are going to be problematic, because we cannot integrate D2(0) over d*z. So consider the set of all 


disconnected contributions and call them V, enumerating the different graphs as Vj, V2,---. (For example, Vi, could 


be the diagram or It will turn out that in general we always get total contributions of the form 


1 
nj 
(connected part) I] V, nl’ 


so adding up all such contributions 


1 al 
ae n bo eae nj 

y y (connected contribution) [] V, ha y (connected contribution) [] ( y V, x) ; 
connected graph {nj;} i connected graph i 


which simplifies to 
= S- (connected contribution) e2'. 


connected 


So if we apply this to our time-ordered product and want to evaluate something like the numerator 
(o|T{bii(yye'l Jo), 


then the expectation will be the sum of contributions over all connected diagrams (the two graphs we saw for De(x—y) 
and De(x — z)De(y — z)De(z — Z), as well as other terms), times the exponential of the sum of the disconnected 
diagrams like De(z — z), De(z — z)De(z — z), and so on. And the denominator is particularly simple because 
there are no connected diagrams, so we just have the exponential of the sum of all disconnected diagrams. But 
that means that when we divide, the exponential “cancels out,” and thus if we wanted to calculate something like 
(Q\T {by (%1) +++ 4 (%)}|Q), we just sum over all connected diagrams with n external points x;,--- , X;, with as 


many zs as we want but with each z coming with a power of A (and for a theory of the form im f d*zo*(z) we may 
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have four lines to connect from z). So we can now just draw diagrams like the one below and get a A° term, and we 


want to understand what the prefactor is corresponding to it. For example, consider the following diagram: 


98 | 


This term corresponds to trying to calculate one of the possible types of contractions in the term 


. 3 
/ Azd*ud'w a (-3) (0|o(x)d(v)(z)*o(w)*4(u)4.) 


in which we need to figure out the number of ways to contract terms to get all of the edges that we drew above. 
But we can just calculate the combinatorics from our diagram — for z we get 4-3, for w we get 4!, for u we get ©, 
and then we can exchange z, u and w and get the same shape diagram up to relabeling so we get an overall factor 


of 3!, and multiplying everything together yields ay which mostly nicely cancels out with the prefactor before the 


integral. So we see that the overall prefactor for this Feynman diagram is iy But we can make this process more 
systematic, deriving easy rules for symmetry factors (which are different for every interacting theory): 
+ Every propagator connecting a vertex to itself gives a $ factor. 


1 


+ Every n propagators that connect the same two vertices give us a factor of =. 


¢ Exchanging vertices without changing the diagram yields a factor of 2. 


(And there are programs like QGRAF that help us do these kinds of calculations as well, giving correct symmetry 


factors and statistics. ) 
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We'll start by summarizing what we've been doing in the last few lectures: we're working with an interacting system 
with a Lagrangian 
£ = 5 (0.0)? — ¢meg? — Ags 
oe 2 4! 


for some small A. We can write the expectation of the time-ordered operator T{)(x)d)(y)} in the original vacuum 


|O) as an integral which yields De(x — y), and we can then use Dyson's formula for the unitary time-evolution U(t, to) 
to get a formula for the two-point (and in fact n-point) correlation function in the new vacuum state |Q) in terms of 
correlation functions in the original vacuum. We then used Wick’s theorem to write T{@)(x1)---6;(Xn)} in terms of 
normal orderings and contractions, and it turns out we end up getting a sum over all connected Feynman diagrams 


(since the expectation of any normal operator is zero in |O), we must contract all coordinates in pairs). 
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Example 19 
If we want to calculate the two-point correlation function (Q|T{oy(x)@y(y)}|Q) in our scalar ¢* theory, we have 


a zeroth order term corresponding to the diagram ~ Y | and then at order » we get 5 (a symmetry factor) 


e 
times the diagram * 7 oe Next, at order \? we get + times the diagram ~ y ; 4 times the diagram 


* = y (there's a distinction between “bubble diagrams” and “tadpole diagrams’ based on how these kinds of 


graphs look), and ; times the diagram ~ 


Remark 20. Physically, we should be thinking of z as a ‘virtual particle” at which the field may oscillate but which we 


cannot be observed. And later, we'll see that this has implications (telling us about resolutions and energy scales). 


Recall that to get to a mathematical expression from something like this, we correspond the line segment between 


two points with the Feynman propagator between those points, associate to each other point z besides x and y the 


mdm 


xX ZY 


integral —idf d*z, and then account for symmetry factors. So the diagram 5 times really means we have 


an integral of the form 


-5 [ atzde(x =VJOrzZ—Z)De2=9) 


in i pe eee) i nq ay=2) i dk ie Be) 
(21)4 p2—m?+i0 J (27)* q?@—m?+i0 J (21)* k? — m? + i0’ 


where +/0 is basically the same as +/e and all of the integrals can be written down in any order. We can collect a 


few z-terms, writing that [ d+ze!(?+9? = (2m)*6(4)(q + p), to simplify the above expression to 


nN d4p d4+q _. d*k 1 1 1 
_A”A —ipx —!9Y (Oq)464) (p 4 / . 
2 (2n)4° | mee ay ee) (27)* k? — m? + i0 p? — m? + i0 q? — m? + 10 


And now this is in a form that motivates Fourier transforming: if we switch out by integrating d*xelP’x f d*yelty, 


we see that in momentum space this integral becomes, after carrying out the d*p and d*q integrals, 


dr [ d*k (21)46“) (p’ + q’) 
2 J (2n)* (k? — m? + 10)(p’2 — m? + i0)(q’2 — m? + 10 


We can then read off the Feynman rules in momentum space from this expression: since the propagator in momentum 


space looks like 
4 . 
BP -iv(x-y) 


Die = 
r (2n)4 p* — m? + i0’ 


we see that a line segment with momentum p contributes a propagator term to our expression, any addi- 


Paes 
tional point included in our diagram again contributing —/A, an outward connection (external line coming into our 
diagram) corresponding to an e~/?* term, and we get a loop integration f ag over all unconstrained momenta. 
(And we keep the same symmetry factors as before.) For example, with just the vacuum diagram with incoming 
momenta p and q from the two sides, we don’t have any unconstrained momenta so we don’t have any integrals, just 
eX lay a (20) 45M (p+ q) (with the propagator term only in one direction, and the delta term corresponding 
to momentum conservation). 


But now we can specify what we want to calculate, which is the S-matrix (or scattering matrix) — the probability 
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transition from the incoming state to the outgoing state. Specifically, we consider the initial and final states 
/) = lim |(t)),  |f) = lim |6(t)) 
t+—o0o too 
(we can imagine wz as being some set of particles, and ¢ as some potentially different set of particles), and we define 


Sri = (f|S|1) (FUT, Tlf) , 


= lim 

T—00 
which is the probability amplitude for a transition |/) — |f). Here, remember that U is a time-ordered exponential 
which can be written as 


co love) 


U(00, —00) =a | dey f dtyT {Hj (ty) --- Hy (ta)}, 


n=0 Z 7 


where because U is unitary we aren’t gaining or losing probability. We can write the scattering matrix element as 
Se = Oe (ro (pp = Te SA + TT, 


where we often write the 6 function as 1 (it’s in some sense a “unit”) and T is called the transfer matrix. Then it 


turns out that (we'll prove this later) our computations are made easier by the fact that 
Sri = (FIS|) = 0 (FSI )|o 
but where on the right-hand side we only sum over connected diagrams that are “amputated” (see below): 


Example 21 


For 2 — 2 scattering, in which we have p; + Po > p3 + Pa, we have the matrix element 


Sri = V/2E1/2Eo/2E3V/ 2Es (0|a(p3)a(p4) Sa" (p1)a" (p2)|0) . 


Here the V2E normalization gives us Lorentz invariance, and the idea is that we prepare momentum eigenstates 
for p, and po and also for p3 and p, and see if they are related using the S-matrix. And we modify our Feynman rules 
slightly, saying that we won't connect propagators coming from the outside in an amputated diagram, so that instead 
of a Feynamn propagator term of the form e~/°* we do not have any propagator from p; and p> in our scattering. 
And now the probability density of a process i + f occurring is P_s¢ = | (F|S|/) |?. 

But to get from these plane waves of particular momenta to an actual situation where we have particles colliding, 
we'll prepare wavepackets which we want to scatter into each other. Specifically, our initial state can be 
d?p; d?po 
(21)? (21)? 2p, Wp» 


If, fa; 1) = f, (21) f2(P2) |P1, Po) . 


where we can write down (for example) a “molded Gaussian wavepacket” with momentum-space shape given by the 


function {82 = —1_ e-i(P-k)*/AP* with Ap < |p|, and we normalize so that Sli? = 1. Then the probability 


QW. ~  (mwAp)3/2 
amplitude looks like (since we're integrating over the square of the S-matrix element, we have the matrix element 


times its conjugate) 
_ f Pr Pps Pp, dp, 1 
(21)? (2m)? (217)? (217)? (2p, )(2Wp, )(2u}, )(2wp, ) 
(2m)*5 (pr + po — pr)(2m)*5 (ph + Py — pr) (Or. PalT IF) (FIT |p, P2) 


Por = | (FIS|A, fa: 1) |? 


f, (p1) f2(P2) fi (P1) f (p>) 


(where the *« denotes complex conjugate).We can then convert by Fourier transform again, writing 276(4) (pi + 
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Py — Pi — Po) as ff d4xel(e+P2—Pi-P5)* and for each momentum variable we can convert with an integral f(x) = 
dp) 1 en ipi ; joo =s . 
ri (2m)? Jam, * (Pie ‘PX where (here is a physics statement) we are saying we can write p; = p; + O(Ap;) because we 


have narrow wavepackets. (The point is that the wavepacket will peak at a particular momentum, so the contributions 


with something like (f|7T|p1p2) will only come primarily from (f|T|p;p2).) We thus find that 
A)F A(x)? a _ 
Prix f dbx TINT (on)*8 (or — Br — Ba) (FIT IPPs) P + O(0%). 
Wp, 2Wp, 
(And we're only looking at a particular space-time coordinate x, corresponding to the local interactions of the scattering 
process, so we only need one space coordinate. But then we integrate over all of space because interaction can happen 


anywhere.) Notice that we've switched from S to T during this process, and that’s saying that we don't care about 


the “unit 1” contributions to the scattering amplitudes because that is the case where nothing happens. So we can 


dP, 
Prax f atx ae 


and think of the integrand as a scattering probability density. We'll see next time how to relate this probability density 


write this final expression we get as 


to something that we can actually measure! 
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Last time, we defined the S-matrix for scattering, which is one of the main tools for doing calculations and under- 
standing observables. Specifically, we found that if our initial state is |/) = limt4~..|W(7)) and our final state is 


|f) = limt+oo |W(T)), then the matrix element we care about is 
Sri = (F|S|) = (F[U(o0, —00)|!) 


which we claimed is just 9 (f|U(o0, —co)|/) 9 (the expectation when we look only at the connected and amputated 


diagrams. 


Example 22 


Consider four-particle (2 —> 2) scattering, in which particles of momenta pi, P2 become particles of momenta 


P3, P4. 


We can write down the matrix element in terms of our momentum eigenstates as 
4 
Sri = II Vv 2E; (0|a(p3)a(pa)Sa! (p1)a"(p2)|0) , 
i=1 
where if we have a small coupling constant in our interacting Lagrangian term —7 Ard", we can do perturbation theory 
S = U(co, —00) +T fel eert =1 + f axe; fuse 
and find that this simplifies at zeroth order to 
4 
Sriloiar) = Il 4/ 2E; (olararalal|o) = (2m)°(2E1)2(E2) [o(o1 — p3)5°) (pr — pa) + 6) (pr — pa) (p2 — ps)] , 
i=1 


where the idea is that a, can either pair up with a3 or aq and a> pairs up with the other one, and here because 


Eps Vp +m? conservation of momentum also means conservation of energy. But we care much more about the 
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terms where we do see something happening (scattering), so we're not going to care about that leading order term 


0). 


Again we need to pair up our creation and annihilation operators until we get to normal ordering. We have a few 


and now want to study the next order. We have 


4 
Sriloar) = Il af 2E; (0 
i=l 


aaa f ax (-z) bxbxbxdxal al, 


calculations that we need to do for this: 


+ Letting a3 = a(p3), we have [a3, ox] = f On TE (e-'™*[as, ap] + e!*[as, abl), where the commutators are 0 
p 


@!P3X 


V2E3° 


and (2m)?6°) (93 — B), so doing the integral gives us 


¢ There are other Feynman diagrams we can draw too — basically, we should imagine that we have 1,2,3,4 
connected with two lines (either 1— > 3,2— > 4 or 1— > 4, 2— > 3), plus either an additional loop attached 
to one of those lines (each of those has $ symmetry factor), or an additional disconnected loop (each of those 
has a ; symmetry factor), or the two lines can intersect. But for the S-matrix element that we care about, we 


only want the connected, amputated diagrams, which can be described as shown here: 


Pr P3 


p2 P4. 


And the point is that we only care about the contributions within the blue part when we say “amputated.” 
(For more, we should look up LSZ reduction, which we'll come back to in a few weeks.) Here, remember that 


“connected” means “connected to one of pi, Po, 23, Pa.” 


So it turns out that contributions to Sr; of order A only come from the case a , where the center point 
represents the spacetime point x (and then the contributions of order are where we have two external points instead 


of just one, and so on). This diagram contributes 


ix(p3+pa—p1—p2 


4 
e 

=i QE; | d* = —j\(20)*6 2 a) 

Cov if eee = ~iM2n)*8 (D1 + pa — Pa ~ Ba) 


so that we really have 
Spi = 1+ i(20)*6 (p; — pF)TFi, 


where Ty; is just —A. And we could have used Feynman rules for matrix elements to get here faster: internal lines 
give ou external lines give us 1, every external vertex with four lines coming out of it gives us a —/A, we get an 
f eb for every unconstained momentum, but we must have momentum conservation at every vertex, and we need 
to think about symmetry factors. So imagine now that we have the diagram as shown below (where 1, 2, 3, 4 just 


mean 1, P2, 23, P4, and there is some momentum k going up and momentum k’ going down in the loop): 
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le— +—03 


2e—+ >—04 


Then the Feynman rules give us p4 = py + po — p3 and k’ = k+ po — pg = k+ pi — pz, so the contribution to S¢; 
will be (the complex number) 
(—iA)? d*k i i 
2 (2m)* k2 — m2? + 10 (k + po — pa)? — m2 + 10° 


And remember that our Feynman rules need to look different for different interaction Lagrangians: if we had 
something like Ag, then each external vertex would need to have three outgoing vertices instead of four. The point 
is that we're abstracting away calculations just by reading off what £; looks like. (And we can take a look at the 
FeynRules package for assistance too — this is meant to be systematic.) 

Last time, we went a bit further and looked at probabilities P,.¢ = | (f|S|/) |? of actually going through this 
scattering process. (This is a real physics question we can ask — we want to find the probability that particles of some 
given momentum will scatter and give the desired output momentum.) If we have an incoming wavepacket 
dp, 1 dpo 


f,, fin) = 
If, fe, in) (2n)3 2E,, (2m)3 2Ep, 


f,(P1) f2(P2) |P1, P2) . 


where for example f(p) could basically be a peaked Gaussian with some width Ap, then doing some algebra and 
plugging into our scattering matrix shows that the probability of scattering is given by 


f ere lSboF lecor 


4 s(4) - _ 2 
Ditls,  Dlting (21 )"0"" (Pp — Pi — P2) |(F|T |P1, P2)|° + O(Ap/p), 


and taking this integrand we can think of it as a probability density (per spacetime point / time slice) of scattering 


an initial state into a final state. 


Example 23 


For example (to draw an analog), if we have two “screens” of particles approaching each other of velocities v1, vo, 


this density looks like oF = |v, — v2|91020, where v1 — v2 tells us about the rate of potential collision and 94, 02 


the density of particles under consideration, so that the scattering cross section @ is 


a 1 Hl? VP con) 45 (pe — pil (F/T pt, po) [2 
|V1 — V2|01P2 2Wp, 2Wp, 


where f; is the probability distribution of finding a particle at a particular spacetime point, so |f,|? = pr. 


We can then write that 


Pp P2 a 
E,E5|v1 — v2| = E1 E> Sy) = |Pi|[E1 + Eo| = /2(01P2)? — moms 
FE, E> 
(this is just kinematics and using that ), = —/> in the center of mass), so that our cross-section looks like 


_ (2n)*5 (p1 + po — pr) 
V (2p1P2)? — mzm3 


So this number basically gives a sense of how difficult it is to hit a scattering event. For physical situations like at the 


|(F|T|p1, P2)|? - 
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LHC, we might see that the number of events is given by N = Lo, where o can be calculated from first principles 


and L is the luminosity (an experimental parameter) given by ana , where Apeam IS the cross-sectional area of the 


intersection beam of particles and f is the frequency. 
But the point now is that even when we have an intersection event, many different things could happen. From a 


probabilistic point of view, what we can write down Is that we have 


Nz 


1 = /iWe 2 
= [ ae Gene = m?)(2m)*6 | pr + po — S G | Kq1.--+ . Gn.|T [Pr P2)| 
2/(2p1P2)? ~~ mem states s (27) 7 j=l 


where N, is the number of particles in our final state s (so we just integrate over all possible momenta that the 
outgoing particles have, and we look at the cross-section that would be produced from such a scattering), though we 
need to double-count If we have identical outgoing particles and add a Bose factor + whenever we have n-fold identical 
particles. (This is called an inclusive cross section.) But it’s interesting for us now to think about differential cross 
sections — if we want to pick out a particular value of the observable of a particular particle and find 6(X—X({ai}, P1, P2) 


(where x might be a particular energy of the 15th particle or something), we need to consider the equation 


da 1 ae. d* qj 2 4 
= 7 54(q? — m?) (2m) 
dx 2/(2p1p2)? — mms states s I (2m)* 


Ns 
54) (» + Pp2—- S- °) | Mp. +p att aNg |?5(X — X({qi}, Pr, P2). 


i=1 
And if we have just a single particle and want to think about its decay rate, then the expression for probability that we 


want to write down is Al \2 
x 
Pra = f tx DOE (anyts(o1 = pr] (ellen) 


PL 


where f d*x|fi(x)|? = 1. Choosing a frame so that the particle is at rest and wy) = m, we find that 


Ns 


1 d*qj ~ 
r= Ss JI (mye oe (GF m?)(20)* - 206) (» ~ y«) | (a1, °** . Gng|7[P1) I*- 


final states s /=1 q=1 


And in particular, we get the expected lifetime of the particle by calculating 7 = z if we know the decay rules for 


different kinds of particles. 
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We've been dealing with scalar quantum field theories up until now (and gotten to a point where we've related our 
formalism to something that we can measure in a scattering experiment), but we'll now change topics dramatically — 
we'll be discussing fermions today, starting with the Dirac equation. Dirac did have something to work with — he knew 
that the Schrodinger equation ido = fy + V(X)w was non-relativistic but definitely useful (for studying systems 
like the hydrogen atom), and so to get a relativistic equation we should try to have an equal number of time- and 
space-derivatives. We know that in the Schrodinger equation we basically have E = g +V, so it makes sense to try 


to work with a linearized version of Einstein's equation. But we see that 


1 5 
E*— p?—-m* =0 = E=V/pim =m a Pe Ap) 
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and the right-hand side has various numbers of derivatives so it’s not going to work as we want — Instead, we'll take 


the square root of our Klein-Gordon equation and write 
—(i0, — m)(i0, + m) = OF +m’, 


which doesn’t quite make any sense because the indices don’t line up, but the point is to use this to motivate an 


ansatz for the first term. We're going to try to construct an equation of the form 
(7° + iy'0; + mW =0 = > (E70 +7'pi + mv =0. 


So to figure out what the ys should be, we multiply by the inverse of Yo to get 


(io + ig *Y'Go + my!) W = 0, 


and then multiplying by (09 (taking another time-derivative) yields 
— 85h = (—iy9g*y'8 + yg") od = (—G 9" V4 H:j + iG Ig *Y'; + iG "G7; — 7 (1q*)? wb. 


where in the last step we've substituted in the expression for /Oow from the boxed equation. So now matching terms 
back in (to try to get Klein-Gordon), the blue part tells us we must have (7 ')? = /, meaning that Yo = Yo -, and the 
red part must go away because it’s not in the Klein-Gordon equation, so we must have {70° 4 =0 = {7,7} =0 
(where {A,B} = AB + BA is the anticommutator). For the green part, we can first rewrite as —y°y'y°/0)0; = 
pyey'¥i0; = y'7/0;0;. Since we have two indices (and we can interchange them), this last expression is the same 
as $(y'y/ + /7')0)0;, and now we want this to be equal to —O? to match Klein-Gordon. Thus {7/,/} = —26). 


We can now collect all of these different relations together, known as the Clifford algebra, 
Ley paso 
We claim that these y4s are matrices, and we can find out a little more about them: we have the trace 


Tela] = Tele? y'] = Trev 9'9°] = Tr 9'9°9"] = Try] 


where we've used our relations in the first two steps and the cyclicity of the trace in the last. Thus 7’ is traceless, 


and similarly we can check that 7° is traceless. Furthermore, because y°y° = / all eigenvalues of 7° must be +1, and 


because {y',/} = —26;; we must have (7/)? = —1 and thus all eigenvalues of y; must be +/. Finally, we want our 
Hamiltonian H (whatever is on the right-hand side of our equation) to be a Hermitian operator. Thus we must have 
—i9¥'8; + m7? = gto; + om), 


and all of the eigenvalues of Yo are real so an = Yo (this is like saying the mass term shouldn't change). On the other 
hand, matching the other term tells us that y/o = —y°y' = 77° (because y° and y' anticommute), and the point is 


that we get y°y47° = (y#)t. Additionally, because the trace of -y° is zero, y° must be even-dimensional (since each 


eigenvalue is +1). If we try two dimensions, we need four complex 2 x 2 matrices, and it makes sense to use the Pauli 
matrices to form a basis {/, a'} — unfortunately, this doesn’t work because the trace of / is 2. So we must try to make 


the y“s four-dimensional matrices. Recall that the Pauli matrices explicitly look like 
0 1 O -/ 1 O 

oi = » O29= ; 2S ’ 
it ig eli 0 lh. Se 
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and from here we can write down a basis (the first expression in block form) 


10 0 
=|, ee 
0 -/ oo 4 
OO a, 1 


along with 


F 0 Gj Pe 
y= i =O © 102; 


this is known as the Dirac representation. An alternative representation we could also use is 


Oo! 
= 
; 0 


this is known as the Weyl or chiral representation. The point is that either one of these will satisfy the Clifford 


=0; 0 


ee pat “| <0 a 
= 1 “Y= = 70 & 102; 


algebra (as well as many other examples), but we can just pick any representation that works and we'll get the same 


physics. But summarizing our discussion, we have an equation 
(iy°Oo + iy'Oo — m') W(X, t) = 0, 


which we can further simplify as 


(id,y" — m)p(X, t) =| (iB — m)Y(X%, t) = 0}, 


where we use the “slash” notation A = y"A,. We can now test and see if what we've done makes any sense — we take 
0 = (if — m)¥ and multiply it on the left by (—/@+ m), and we'll find indeed that we get (8? + m?)p = (02 + m?)yp, 
so we get the Klein-Gordon equation back again. The point is that we've managed to introduce new quantities in a 


way that looks like the Schrodinger equation, which is what we wanted! Furthermore, we have 
1 
00 == Bit VOLO, = mmenen — om 


We now want to write down a Lagrangian, because once we have that we'll be able to apply what we've been doing 
in the class so far. We may first try to write 

L=p'(id — m)v 

<< 

so that we have a scalar Lagrangian, but we find that £1 = wi(—iy° B y° — m)w, where the left arrow means @ acts 
on the left instead of the right, which isn't quite what we want. So we'll have to try something slightly different — we 
can define w = Wiy® and write down 

L= Pid — m), 
so that = 

Li = ph (—ig? By — my"; 

here we've used that p' = (wiy?)t = (7°)i = yw. So then by using integration by parts because we can add a 


total derivative to the Lagrangian, we end up with 
Li=—pid +m)p = Pid — m)y, 


so this is a good Lagrangian to use. And for Lorentz invariance, if we replace w(x) with U(A)w(A~!x) (the same 
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transformation as for scalar fields), wi will be sent to Wi(A~tx)UT(A). The mass term of our Dirac Lagrangian then 
tells us that we must have 
UTA} U(A) = 7°, 


and the derivative term says that 
UTA) PAHO’ U(A) = 7°8. 
This means that (inserting a unit U(A)U~+(A)) 
Ut (A)YU(A)U (A) YA, UA) = 9°", 
but now the first three terms form y° so what we really have is 
Chui any, == OOo. 


(Note that U is different from the time-evolution operator that we derive before!) The point is that an arbitary 


infinitesimal Lorentz transformation looks like 
AMY = gtY + wt? Na gP™ Ng” 


which tells us that we must have wk” = —w’#. A general transformation of a field @3(x) is then Usp@p(A7!x). Since 
we must have U(A)U(A’) = U(AN’) and in particular U(A)U(A~+) = —/, we must have U = /+7, where tT = wh” Myp. 
But if we write Muy = ay + DW, w being antisymmetric shows that we must have b = —a. If we plug in our 


ansatz and use the Clifford algebra wherever we can, we find that 


1 


1 
a=s My 3 [Yu Wl. 


Thus if we define oy, = yu yw], then for any infinitesimal transformation we have 
; 
U(A) =1— qu ouw: 
meaning that for finite Lorentz transformations we get 


U(Ay= eae, 


Example 24 


Let’s see what a rotation around the z-axis does to everything here. 
0) 
0 
0 
0 


U(a) = etl ow, 


Recall that a generation of the rotation is w = 


Oro Oo 
| 
an 


0 

0 = jBY: H 

: =wele ; in general, we then get 
0 


03 


In the Dirac basis, we have o!? = 4[y1, 77] = 
0 03 


| , so that our infinitesiaml transformation is 


u / o3 O 
T= $012 5 cu) : He 
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so that our field transforms as 
ee 
3¢ . 
yore 73) y, 


which is indeed (cos $ + isin g) w. In particular, we see that w flips to —w for @ = 27 and flips back to w for 6 = 47, 


which is exactly what we have in spin one-half. So we have indeed arrived at fermions! 


11 November 1, 2022 


Last time, we introduced the Dirac equation and Clifford algebra to set up a framework for describing spin one- 
half particles. We started with the Klein-Gordon equation and manipulated it to get a linear equation that describes 
relativistic wave mechanics, using the operator @ = y40,,. Those matrices y" need to satisfy a certain anticommutation 
relation {y“,y”} = 2n#”1, and then we saw that our wave fields transform infinitesimally as p 3 e740” ah under 
Cie = su ie. 

Reviewing some properties of our y matrices, we want to solve the equation (if — m)w = 0, expanding out the 
indices gives us the equation 


idow = ((—9°) 7190; + (my®)~*) , 


and plugging in a plane wave y = e~'* gives us the equation 
Ew = (9°) 1B F + my") *) ¥. 


Since we want E to be real, we saw that (y°)~t = 7° and that y°y'7y° = (7')?. Now we can get an explicit basis for 


the (4 x 4) matrices -y — we're going to use the specific choices 
= 
Pah hese, Cy = Ow 


(where s,v, 7 stand for “scalar,” “vector,” and “tensor”), as well as 


i 
4! 


2A3 


2 SS eae ee 


(calling this matrix y° is because it serves as a fifth basis vector, and P stands for “parity’). Additionally, we have 
ie = 57 u 


((here AV stands for “axial vector’). There are 1,4,6,1,4 matrices for the scalar, vector, tensor, parity, and axial 
vector basis elements (6 for tensor because we need them to be antisymmetric), and we can check they are all linearly 
independent so this gives us a way to write down any 4 x 4 matrix as a linear combination of these terms, and each 


of them will correspond to some physics. We'll write down a few more properties that we can check now on our own: 
ag = 
* {¥5,%u} =O for all p. 
> (¥5)' = 4s. 
> Tr[y5] = 0. 


! 0 
« In the Weyl representation, we have ys5 = ; | in block form. 
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But a Lagrangian is never a vector, so we need a way to form Lorentz scalars out of all of this, and we'll do so by 


creating bilinear forms with these different matrices in between. We now see the reason for our naming — we have 


oy = ory, 
which is a scalar under Lorentz transformations, 
wee = Wu, 
which Lorentz transforms as a vector, and 
Wi 


which Lorentz transforms by multiplying by [,,”. Then we also have 
wp = drs 
transforming as a pseudo-scalar (changing signs under parity inversion), and 


We py = bye WY 


YW 
2 Wr 
transforming as an axial vector. So now if we have any Ww = pal we can decompose wW into (so Wr is the 
3 L 
Wa 


upper two components and wz, is the lower two) and define the projectors 


re htt) =| i r= ga-w= | if 


Indeed they are projectors because Pe = Pp and Pe = P,, they are complete because Pe + P,; = 1, and they are 


orthogonal because Pr- P, = P, - Pr = 0. Additionally, they have commutation relations 


1 1 
a a) er Ce (0 led a 
by the anticommuting relation of 5 with yp. 


Remark 25. Al/ of these statements are basis-independent, but using the Weyl representation makes it easier to see 


how some of the computations work. 


Turning back to the Dirac equation, we see that 


Wid — m)b = O(Pr + PL)(iB — m)(Pr + Pr)w = WPRIBPLW + WPLIBPRp — mp(PRPr + PLPL)Y, 


and now if we define WPr = w, and WP, = Wp, so that Prw = Wer and Phy = yw, our Dirac Lagrangian becomes 
L= pridde t+ bribe — Vadim —bepem. 


So if the mass is zero, we have two copies of the same field here, and thus having two different particles sitting here 


gives us an enhanced degree of symmetry. We can next see the property 


wWWR=WR, WwW. =—-W, 
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so this is a chirality property — fermions with a definite x are called chiral fermions, and if we commute this operator 


with the time-evolution operator we find 
[H,-¥5] = Hy5 + iysy°y'0; — Ys¥°m = 27°y5m. 


So if the mass is zero, then chirality will be unchanged under time-evolution, and so for massless fermions this is a 
good quantum number. 

Keep in mind that everything here is still for the free Dirac equation. But nothing that we've done is quantum yet, 
so like for the classical field theory we'll want to quantize everything here. But first we should find classical solutions 


and study their properties. We start again with the Lagrangian 
L=pid—m)p 
and solve the Euler-Lagrange equations a — Oo. (=) = 0 and similar for w. We end up with 
—ym +d, (piy*) =0, (ii-m)p=0 


so this is the Dirac equation from the Dirac Lagrangian (as we expect). If we now define 


Wr2=us(p) =e, 3.4 = ve(p)e’™, 


we want to have 
(i — m)~i2=(~-m)fi2=0, (iB—- m)v34 = —(p+ m)b3,4 = 0 


where p = py,,. We can make our life easy by defining the spinors 


us(P) = (P+ m)Ax(P)uot, Ve(P) = —(P— m)Bz(p) voz, 
where Ax(p) and Bs(p) are some scalars and 
1 0 0 0 
0 1 0 0 
U4 = V2Mm Ug = V2m ‘i Vo. = V2m il: Vor = V2M 
0 ) ) 1 


Plugging into the Dirac equation and using some matrix algebra, we find that 
1 
(p + m)(p— m) = pp — m? = 5 php’ {yy} — m= php’, — m° = p* — m* = 0, 


assuming that we are on shell (meaning that we have a particle living on its mass shell). We can then also define 


ffl 


so that uj4 = V2m : — this will be useful later. 


Next, it turns out these spinors are properly normalized: we want to have 


U4(p)u4(p) = 2m, 
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a Lorentz scalar, where U = Uty°. We then have to define our constants Ay so that 


2m = Us(p)Us(p) = |Ax|?uh,. (p + m)*¥0(p + m) Uo 
= |Ax|?Tox(p + m)(p + m)uo+ 
= |Ax|?Tiox(p* + m? + 2pm) uo. 


Picking a particular basis, we have the block form 


ne 


—G-p —p°l 


where @ is the Pauli matrices, and plugging everything in yields 


2m = |Ax|?(2m) | (2m*) +2 


oOo oOo CO F 


(we only care about the first coordinate because of the dot product), which means 


1 
Ay = 7 
s/f 2m(p® + m) 
Similarly, we will want the normalization 
Vive = —-2m Bs = Ax. 


So collecting terms, we find that our spinors are of the form 


_ vam Graal a 
* Pam o| m+ p? 


and similarly 
1 


vy. = ——— 
~  VM+ Po 


Oj PiX= 
(p° + m)Xmp 


So we have explicit solutions to our Dirac equation now, and more generally 


U;Us = 2MOrs, VirVs = 2Mvs, UrVs = Vrus = 0, 


where r,s basically label +. If we define Ax = =n we find that 


Ayus =uUs, Ave =Vve, Agvie=0, Aug =O. 


Thus we find that Ay +A_ = 1, and we have 


2m, = 5D Us(p)Us(p) = prm 


SH 


and similarly 


2mA_ =|— vs(p)Vs(p) = —p +m). 


sear 
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(p? + m)x+ 
—O;PiX+ 


These can be directly computed, or we can use the rules above to see that (> usUs)u, = 2mu, and note that A; 


should be the identity on u+. 


Remark 26. /t will turn out that A, projects out a particle with spin +3, and A_ projects out an antiparticle with 


spin #5, but we'll see this more concretely later. 


So we want to use these classical solutions ux, vt for quantization later, and the boxed equations are really what's 


important in all of these derivations. We'll now move to talking about discrete symmetries of the Dirac Lagrangian: 


+ Parity: we know that A#, = diag(1,—1, —1, —1) is a valid transformation, and we have 
Uty#U = My’, U= el? => Ut = 1% #. 
(So under parity, w goes to y° e/a.) 
+ Time reversal: we can check that a Lorentz transformation diag(—1, 1, 1,1) corresponds to U = i713. 


+ Charge conjugation: this is useful for quantum electrodynamics, in which we want invariance between particles 


and antiparticles. Here if we have 
Laep = —epA"ynw = —evAy, 


and we do the charge conjugation transformation 


Vid — eA— mb > $ (iB + A— mde, 


then we want to figure out how the fields should correspondingly transform. Taking complex conjugates flips the 


sign of the i@ term, so the transformation gives us 
(psit)*(9°)*(—i8* — eA* — mp", 
and now if we introduce yw. = Cw* for some invertible matrix C7~1C = 1, this expression is the same as 
(CCH) "9° (—i("*)* yp — CAS = mC PCH = WLC YN" (=i(y#)* By — IA = m) CMe. 


Now matching terms with our ansatz, we find that (C~!)'C-! = 1 so Ct = C7}, and C(y#)*C71 = —4. 
Furthermore, the imaginary part of y°, y!,-y? should be zero, {C,y°} = {C,y1} = {C,y?} = 0, and then we 
should have 73 = —Yo = > [C,y2] =0 and a = —1. So flipping the charge of the electron is a symmetry: we 


have 
C= in => be = Io" 
in this particular representation that we've chosen. 


Next time, we'll finish up discussing properties of the Dirac Lagrangian and move on to electrodynamics! 


12 November 3, 2022 


Last time, we analyzed the Dirac Lagrangian L = W(i#—m)w and found some classical solutions to the Dirac equation 


Vi 


1 0 
of motion: letting x, = A and x_ = a (here we're looking at the spinor space, since w is of the form ), 
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we wrote down plane wave solutions 
y = u,(p)e'P* fs u_(p)e'P* fh v4.e!Px &. v_elPx 


where p? = m? and in the Dirac representation we have 


eee ek (p° + m)x-+ a: & + PXx 
~  V/pli+m | -F-pxe || Vp? +m |(p) + m)xz}- 


In particular, we get unit vectors in the spinor space if we take p > O and p® + m. We then looked at symmetry 
transformations (parity, time reversal, and so on) and tried to figure out how the spinors change. Specifically, charge 
conjugation symmetry pty We = iyoy*, the Lagrangian stays constant and we have —ewAy sent to ewAw. So now 


if we look at the ux and u_ terms of our plane wave solution, we have under charge conjugation that 


. 1 0 ios} |\(p?+m)xa]} 
Wou = Yor, a ; ee el 
Vpi+m|—io. 0 = * PKs: 
(recall that the first matrix above is /¥2 in block form), and now we can perform the multiplication: because idox+ = X= 


by definition of the Pauli matrix, and joo(@- p) = —G- px=, 


o - - j 
oi = V+ el Px = Wy 
(p° + m)x+ 


1 
Vp? +m 


so something we associate with particles ends up being associated with antiparticles. So this is explaining why when 


Wevu = 


we change the sign of the charge (dealing with positrons instead of electrons), everything else will be the same. 


We'll now introduce certain linear combinations of spinors 
by = uel — viel, py = ue + vel, 


where Wi.¢ = Wi and Wo.- = Wo, so these particles are their own antiparticles (we call them Mayorama particles) — 


for example, neutrinos could satisfy these properties, but we presently don’t know whether they are. 


Example 27 
One ongoing idea for checking whether this property holds for neutrinos is to observe the neutrinoless double 


(-decay process (OvGG). Consider the particle decay 
N+>P+e 4+2D.¢. 


(In particular, the maximum energy of the electron would be Emax(e~ ) = My — Mp — my, and repeating this can 


give us an estimate of the neutrino mass. This is ongoing work — we know that my, < 0.7 eV, and the best-fit 


value is around 0.26 + 0.34 eV.) This process has to do with Mayorama fermions, because (when doing this decay 


process) we could imagine having two neutrons coming in, both undergo beta decay, but with a neutrino absorbed 
from one decay vertex to the other if neutrinos and antineutrinos are the same. So such a coincident decay could 


be observed, and that would imply that the neutrino must indeed be a Mayorama particle. 


There's one more transformation that we care about, called helicity. Recall that we previously introduced 
_ iggy i 
ya eat Surdy, Opvy = 5 W) 


and we saw that we have spin one-half by considering the case 012. We can now write down a more general spin 
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matrix 


=e (77, 7], fy. Ya). [12, Y2]) = : | 


and we define the helicity to be the projection of this matrix in the direction of our momentum: 


Helicity is then a valid quantum number, because recalling that we have the equation of motion 


idow = Hw = (—i7°7'0; +-¥°m) yo, 


m oO-p ; —o-p m 
we can write the right-hand side as | _— =: | W in the Dirac representation or 7 ] w in the Weyl 
o-p —im —m 6-p 
representation, and then we have [h(p), H] = 0. So we can find some states of definite helicity Wy, but if we want to 
0 
stick with the Dirac representation, we can imagine we have a particle flying in the z-direction 6 = | 0 |. We then 
Pz 
have 
1 03 0 1 
h =—_ => +h =i 
(ez) 210 a (€z)X+ aXe 
meaning that h(e,)us = +5ux(p) and thus a +1/2 state (a right-handed particle) if the spin and direction of motion 
are aligned and —1/2 (a left-handed particle) if they are antialigned. We'll use X for helicity. We can similarly find 
that for the left-moving plane wave, h(e,)vi = F5vz(p), and we have A = 4 if the spin and particle are aligned 
again (but this time we have a right-handed antiparticle) and \ = —$ if we have opposite alignment (a left-handed 


antiparticle). But we don’t have Lorentz invariance for the helicity — for a particle with mass, we can imagine 
boosting into a frame faster than the momentum p, which would flip the sign. However, it’s still useful to measure in 
a particular frame in the lab (for example polarization), and it’s still a valid quantum number. 

So in summary, Lorentz invariant combinations of spinors take the form wlaw (where the Is live in the space of 
16 matrices that transform in various ways that we've previously discussed), we have four fields in w, corresponding 
to particles and antiparticles with helicity-spin in or opposite the direction of motion. We're now ready to quantize 
our fields here, and the anticommutation turns out be a little weird. We want to think about observables, so we must 
first figure out what quantities are actually observable — the issue is that we don't know what to do with the indices 


of w. So observables are really given by local observables at a single spacetime point 
O(X, t) = W(X, Hr av(®, t), 


for which we know how these transform (some as scalars, some as vectors, etc. under Lorentz transformations) and 


all of the w indices are now contracted. We then see that we want causality, so we require 
[O(X, t), OY, t)] = 0 if (xy)? <0 => M)1Y(x), By) Fov(y)] = 0 


for spacelike separation. One such observable is the momentum p", which is the conserved quantity associated with 


the Noether current of spatial translation: 


OL 
—— d Od, — NL 
p= | Praag yorte— the 


= / dex [ipyOXw — no (Pidp — pmy)] . 
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We do see that everything written here is in terms of “bilinear forms” of 7, and now we want to know how to work 


with this. Much like our procedure for the scalar field, we postulate that we have a mode expansion 


d Pp 1 —ipx 1px 
Baye Fast) Do, etal + bivalve?" 


a} 


W(x) = 


where because we have a complex field we have two sets of creation and annihilation operators, a and b. We then 


have 


a Oe aa [alu.(p)e’™ + bs Vs(p)e~ aad 


a) 


V(X) = 


where remember that Ui; = uly°. We now want to interpret a!(p) as creating a spin s (s is one of +1/2) fermion field 


/ particle and similar for bl, so we want the commutation relations 
[p“, as(p)] = —p*as(p), [p", bs(p)] =—p"bs(p), [p*, al(B)] = p*al(p), [p*, b1()] = p*bt(p). 


We can then see that (combining these requirements with the definition of w and w) that fermion particles and 


antiparticles must satisfy the anticommutation relations 
{ar(p), a}(q)} = (217)°6-s6° (B— G), — {br(p), bL(a)} = (2m)*5s6 (B— 4) 


and all other commutation relations {a!(p), a!(q)}, {a-(p), as(q)} equal to zero (and all as and bs also anticommute). 


We then also see that we must have 


(W(X). 07. th} =PIOR-Y), (WR, VS. OF = (OR 1), OV, t)} = 0. 


Example 28 


We'll see one example of this in action to demonstrate the calculations involved. 


Plugging in the mode expansions and looking at the t = 0 timeslice to make things easier, we have 


dp dq 


(2m)? (2m)? \/2u(p) =o. 


{Y(%, 0), W(X, 0)} = ee Per! y_(BYVe(q){ IB, bs(4)} 


BR 


ma) 


+e" PX elu, (p)T(G){a- (8), nan 
(here the other two terms instantly vanished because the anticommutators were zero). Integrating over gq, this simplifies 


to 


= | om On ey [eM vars + eM (ust) 


11 


2 


and now we must remember (from the explicit representations of our Dirac spinors) that >> Vs(P)Vs5(p) = pom 


s=i 


Nis 


and daa us(p)Us(p) = ~— m, so this simplifies further to 


/ Bp ol [ep — m) + eP®N(y + m)) 
(27)? 2w(p) 
and if we substitute f + —f, turning p into y°p° + y'p’, this becomes 


d?p 
(21) 


isp . te dp pares 2 = 
eo iP(®-9) [7° p° — vip; — m+-7°p? +-y/p! + m] = mele DR-Y) — y053)( xz — 7), 
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as desired. So the key difference is that we'll have to start doing spin sums and take combinations of our slash operators 
so that they cancel out, but in the end it’s very similar to what we did with scalar fields. For observables, this means 
that 

PRM V(X VY ov) 


and use that [AB, CD] = A{B, C}D — AC{B, D} — C{A, D}B + {C, A}DB to write this as 
= B(%) (PryPo — F271) Y(2)5O(% — J), 


so equal-time commutation relations are zero whenever we live not on the same spacetime curve, and we 
can always find a Lorentz boost back to this whenever we have spacelike separation, so we do have the desired 
zero commutators. That means that p(x) = [ bs at eee [e-'*as(p)us(p) + e!Xbl(p)vs(p)| annihilates 


Vee) 
particles of spin s and momentum / at x and creates anti-particles with that spin and momentum. 


13. November 10, 2022 


In the last few lectures, we've thought about how to study fermions in quantum field theory. We start with classical 
(Dirac) theory and wrote down a Lagrangian, giving us the Dirac equation of motion which can be written in terms of 
certain y matrices (acting on spinors w) that satisfy the Clifford algebra. 

Last time, we studied quantum observables in this formalism and wanted commutation relations to make sense 
locally (in particular having causality, so that we have commutation relation). We also wrote down mode expansions 
for our solutions in terms of creation and annihilation operators a, and bl, seeing that we get the expected complex 
scalar field commutation relations. 

We'll now finish the quantum mechanical discussion of fermions and see what their properties look like — the thing 
that’s still missing at that point is a discussion of photons. We'll start by thinking about the spectrum, which we've 
already hinted at: we want to create momentum eigenstates, where we look at the momentum operator P¥ associated 
with the Noether current. We find that 


[P*, al(p)] |0) = P*al |0) = pal |0) 
(that is, we have an eigenvector of P#), so we'll define the electron state 
|e“ (p. s)) = V/2Epal(p) |0) 


and the positron state 
|e*(p, s)) = \/2E,b!(p) |0) . 


We then have the correct normalization 


as(p), al,(p’) 0) a (2 )32E,6° (jp — P')ds,5 


(e(p,s)|e (p's) = (0 a.(p)al,(p’) 0) (2B Ibs = 4/2E,28e (0 


where we've used the anticommutation relations for aps. And now if we create two particles, we see that we have the 
antisymmetry 
al (p1)al(p2) |0) = —al(p2)al (pr) |0) 


because the anticommutator is zero, so |e (pi, r), @ (P2,5)) = —|e (po, 5), e (p1,1r)) and we have Dirac particles, 


specifically fermions (antisymmetry of the wavefunction under exchange). If we now consider the generic one-particle 
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wavefunction 


n= f 28, Cay De, HPN0)10 


an 


ol 
(which could be a wavepacket or a state at a particular frequency) and we consider “two of them in a single wavefunc- 


tion,” we have 
dp dq 


a= (mn)? (On) » f.(p)f-(4)al(p)ar(p) |0) 


and applying antisymmetry, we also see that 


an) =— f 28, £4 kiwyntapal(o)ai(o) lo) 
(27)3 (21)3 oe Ss p r q r p Ss p 
and then swapping the roles of p and q gives us back the original expression, so |2f) = —|2f) and thus |2f) = 0 - 


this is the Pauli exclusion principle. 


Remark 29. Remember that the whole reason we have anticommutation relations for fermions is that we required 
[D(x)Fiv(x), b(V)loW(V)] must be some number times 6°) (X — Y) if we want causality. And commutation relations 


for such observables requires anticommutation of fields. 


We'll now turn to the conserved quantities in Dirac theory — we'll follow a similar path as what we've done before. 
We have 


L=Wid-m)yp => aut = a, ( we 60) — o£. 


O(Onby 


Consider an infinitesimal translation by a4, so that we have 
p= od, 60 = PO SS 6LaS PO jl), 


and for translation invariance, we see that 
MY — AD ijHOv a. 


Plugging in our mode expansion, we then find that going into normal ordered form, 


Pe -/[s ooo dat a; — bs bt) = = [é (On ae So (alas ats bib, _ (21)°5°) (0) 55.5) 


but much like before this rightmost infinity term is a vacuum energy which we can ignore (since we only care about 


energy differences). Similarly, if we consider multiplication by a phase w > eV and w — e7'7y, then we have a 


Q = f xj" = af oe So (alas — bibs) 
(2n)3 - sas sYs), 


and notice that this is a count of particles minus antiparticles. So if we take q = —e, we see that this gives us charge 


current j4 = qyy"4a, so that 


conservation from symmetry of the Dirac Lagrangian. 

We can list a few useful facts: the Hamiltonian H = p°® is positive definite, the charge operator Q is indefinite, 
[Q, P#] = 0, w is the field operator for fermions, and momentum eigenstates are |p, s) = V2Eal(p) |0) for particles 
and |p, s) = V2Eb!(p) |0) for antiparticles (the p is just notation for having an antiparticle). So now we can solve our 


free Dirac theory in the same way as before: we have non-equal-time anticommutation relations 
dp 1 dq 
(2m )* 4/2Ep (25)"4/ 26g 


(here we've only kept the nonzero anticommutators); plugging in our anticommutation relations gives us a delta 


{(x), (y)} -|/ mule iP!" v,.7(q) {bi (p), bs(q)} + e'*4'™ uy (p)Us(q){ar(p), ab(q)}] 
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function that cancels out one of the integrals and one of the spin sums, simplifying to 


_ dp 1 
(21)? 2E, 


So [vevse™ Y) 4 @WiPlx Y) ust]. 
Ss 


But noting that }7,VsVs = 6 — mand >), UsUs = p + m (from exercises), plugging back in and writing in terms of 
derivatives yields 
{b(x), b(y)} = / PPh ig m)(er PON) — elev) 
(2n)3 2E, 


and now taking the differential operator out of the integration, this is 


= (if + m)(D(x — y) — D(y— x)), 


so the two-point correlation function should look familiar to us. We can now define the Feynman propagator (vacuum 


expectation of the time-ordered product of two scalar fields) in exactly the same fashion: since we had 
Dr(x — y) = (O|T{O(x), (vy) {010) = O(x° — y°)D(x — y) + O(y? — x°) Dy — x), 


we now have something similar for fermions: the Feynman propagator for fermions is 


Se(x — y) = (i + m)De(x - y) = Sop erin) Em 


We then find that 


d*p e7/P(x-y) i(p + m)(p _ m) 


= j§V(y—y).] 
(Qr)4 p?— 1m? + i0 We =a) tae 


(18 — m)SF(x —y) = 


So we've solved the Dirac theory as long as there is no coupling — the solutions to the vacuum two-point correlation 
function give us the entire theory, and we found the time-ordered product and generic two-point function for fermions. 
So this is basically all we could do for the free Dirac theory, and we've found charge conservation, antisymmetric 
wavefunctions, and the Pauli principle. 


We'll now move on to photons, starting with a review: we study the electromagnetic field 
E; = —0)6- OAi, Bi = €1jx0/A*, 
and where we can frame electromagnetism in terms of a field tensor 
FRY = QR AY — QV AP 


0 -E£, -E> —E3 
Fi 0 —-B3; Bo 
—& B; O —-B 
EF; -Bo -Bi 0 


and writing ¢ = A®° and FHY = , we have the Lagrangian £ = =a PVE iy. The equations 


of motion are then 
O,FeY =0 = > 0"0,A’ — 0” (0,A*) = 0, 


—OE' 
—OoF, + €ijxB ~s = F oe is 
where this is basically telling us that oa anh (so we need V- E = 0 and oe = V x B). One important 
—O, Eo + E;jx Br 
—O9E3 + Eijk Br 


property of electromagnetism is that we have a gauge transformation under which Maxwell’s equations stay the same: 
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replacing the vector potential A; with A; — 0;A and replacing @ with + OoA (in other words, writing AY + A# + d#A), 
and the equations of motion stay the same so the physics should do so as well. This can be seen easily by looking at 


the Lagrangian: we have (here note 0? = 0,0", so 6° refers to applying two derivatives, not the 2 component) 


0° AY — A” (OA) — 07AY + 0”? — OY (BA) — 0"(87A) 


and the second and fourth term cancel out. Since the physics is not changed, we can therefore choose a convenient A 
— there are a few gauge choices that work well. For example, we can use the Lorentz gauge 0,,A" — 0,,A* + A= 
0,A, so that 07A = —0,A" and therefore 0,A’“ = 0. On the other hand, there is the Coulomb gauge where 
OorA = —¢, giving us a shifted potential ¢’ = 0 and where 0;A; = 0. So we should count degrees of freedom in our 
A“s — a classical photon has two degrees of freedom (polarization states), and if we want A” to be a field that we 
put in our Lagrangian and eventually quantize, we need to account for the fact that A” has four degrees of freedom. 
That's what our symmetries do — the Lorentz gauge is actually a family of gauges because A” still has three degrees 
of freedom in that case, and we'll fix that other degree of freedom soon. But the point is that the two remaining 
degrees of freedom will be the correct ones. 

Photon quantization is easier than fermion quantization because we have a vector of four scalar fields: we know 


that A¥ transforms as A“,A’ under Lorentz transformations, so 
[A#, 4] = igh” (x — y). 


So we should do something completely analogous as for scalar particles, but we run into a bit of a snag: the Oth 
component of I in this case is 1° = GTI = 0, so we can’t have the desired commutation relations. So we're not 


going to fix the field values from the start, and instead we'll modify our Lagrangian slightly, setting 
1 
L= a ow — 5 (OuAM) 


and use Lagrange multipliers. We find that we must have 07A# — (1 — A)O“(0,A”) = 0; setting \ = 1 gives us the 
Feynman gauge, » = oo yields the Landau gauge, and » = 0 gives us the unitary gauge. (We'll mostly be using 


the Feynman gauge.) This then enables us to write down 


OL 


ne = 
O( OA.) 


SPL =O AP =I, Ag == PP S=1a4. 
So if the Lorentz gauge is not manifest at the operator level, we'll instead require that on whatever physical set of 
states we define it on, we have the expectation (Wphys|O,A”|Wphys) = 0. (This is called the Gupta-Bleuler quantization 


— it may look like a crutch, but we'll see how it comes up usefully for quantization procedure. ) 


14 November 15, 2022 


Last time, we started trying to frame electromagnetism in terms of quantum field theory. We know that solving the 
equations of motion for the Lagrangian £L = — 5 Fy FY leads us to quantizing the vector potential A* = (¢, A’), and 
we find that the usual canonical momentum 1” = 0#A° — 8° AH leads us to M° = 0, which is bad. So instead, we need 
to appeal to gauge freedom and consider the local symmetry A), = A, + O.A. We chose the Lorentz gauge (which 
is relativistically invariant) in which we can guarantee 0,,A’ = 0 (classically this is adding a gauge-fixing Lagrangian 
—4£(O)A")* and then using Lagrange multipliers), and then we find that M# = 04A° — 6°A# — én°#O°A°. But on the 
quantum mechanical level we'll do a different modification, splitting our set of total space of states into a physical 


and a nonphysical part, where the physical states are those where we have (Wphys|OuA"|Wpnys) = 0. (The point is that 
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we need to include this extra symmetry in our theory somehow, and in future lectures we'll see how to quantize field 
theories in the presence of local symmetries without needing this kind of crutch.) 


From here on, we'll set € = 1 (this is known as the Feynman gauge), so that we have 
1 LLY 1 By\2 1 LL AV LL AV L Vv 
L= — 7 Fw F — 5 (OLA y= =5 [O,ALO“ A’ — OLA, OYA’ — 0, AY OLA’ , 
and now adding total derivative terms to make these terms look more similar leads us to 
1 1 1 
= 5fu (O,0% A” — 0,0" AP + OY O,A*) = 5 AvO,Or AY — —5 (pA )(OAL), 


so that the associated momentum is an 


= = 0 A" 
O(OA,) 


Doing a mode expansion, we can write 


At = 


Ye ret (p)a(p) + eet (pat) (p) 


me = 


where the es (we can put the (A) label in the top or bottom, with no difference) are basis vectors in the directions 


0,1, 2,3 — these are called polarization vectors and we require that 


0) _ ,0 (3) _ 0 (1) — pe. (2) 
pte =p? pt-eS) = p®, pte) = pte =0. 


Specifically, e is called the scalar polarization, e) is the longitudinal polarization, and ef 2) are the transverse 
k 

polarizations. So if we have a massless particle k¥ = ol we can parameterize es to be the standard basis vectors 
k 


in the first, second, third, and fourth coordinates. We'll also normalize to avoid dependence on momentum, so that 


E(y) (PJE(W),u(P) = Mrw- 


(We should think about having one field per polarization, so we have a “separate photon” in each of those directions 


because they're independent particles.) We claim we have the usual commutation relation 


[AY (x), MV] = igh’5 (x — y) | 


Indeed, plugging in our mode expansion, 


3 3 ‘ ’ 
in. an =~ f ba ae Pane ag Be [te De 10%). (0 


+e YE (pe (a)(ia?)lal™ (p), a] 


and we see that this means we require 


[a (p), ah (a)] =~)? SO(G- B) 
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so that the commutator above becomes 


dp : 
~ SL mp ian ys ” Welty (P)ER) (P)P° le eiPR-Y) 4. giPE-¥) 


which is indeed ig4”5()(x — Y) as we wish because }>) y, = eb) (PEK (P) = —g'”. (Here remember 7 and g are 
the same.) 


We can now write down the Hamiltonian by finding the Noether current: we have 


OL 
jy Vv yh 
/ HOAs)” eames 


1 
= —d!muA,0!nuAs + 5 (QuA”)(O" Ac), 


1 1 
H= oun =) -5n, = SOAPOIA 


Plugging in mode expansions and doing some lengthy calculations, we find that 


so that 


Lf &p 
"=3 } Oma” - —mx [a (p)at®) (p) + at (p)a(p)] 
A, rA’=0 


and as usual we can rewrite this in normal ordering: 


dp 


oma? pei Ay) (a) — Alo) (0) + vacuum energy 


where we ignore the last term. But this should be concerning, because now we are enumerating the modes of 
polarization zero and those can contribute negative energy. But this is where the Gupta-Blueler condition comes into 
play: if we want 

0= (Wpohys|O.A" | phys) ' 
then plugging in our mode expansion shows that we are requiring (here we are using k instead of p for momentum) 


3 
= 2 kHel (k)aca)(k) |Wphys) . 


A=0 


But we already imposed the condition on polarization earlier that e412) (k)k, = 0, and ek, = k® and e#@)(k)k, = 


—k°. Thus we are really requiring that 


k°(a)(k) = a(?)(k)) |Wphys) =0 => (Wonys| aa [abonys) i (Vonys|a"a® bohys) . 


0 TA) Yap 
oar ys vn) 


and the physical state of observables now has well-defined time-evolution and positive Hamiltonian — the other part of 


So plugging this in we see that 


(Wphys|:H:|Wphys) = (i 


the state space is just coming from the additional symmetry and the way that we set up the problem. 
We can now write down our Fock space by writing down particle states (this means we have a particle with 


momentum k and polarization \) 


\(k, A)) = —\/2E, at (k) |0) 
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and also writing down two-particle states 


(Ar. A), (ko, A)) = /2E kg V/ 2E gg al (ky al? (kg) |0) . 


For k, # ko we can flip their roles and interchange the particles, so our states are bosons. We can also create twice 
the same particle ‘ 
a ZEW (aM (kK)? |0) = |2(k, A)) 


and there is no Pauli exclusion principle (we can put in the same particle multiple times); generalizing, we get n-particle 


states satisfying the equations 


VJ 2E x, at” (ki) |m (kr, Ar), 07+ (ki, Ai) = Ve FD | (kn, An) (i FAK AD), 
1 
ya?) [1 (k1, Ar), +++ (ki, Xi)) = Vi (kn, Ar), ++. (i — 1) (Ki Ai), +) 
We should think of annihilation as absorption (reducing the number of photons by one gives us proportionality to \/n), 
so the absorption probability is zero if there are no photons. On the other hand, creation is emission and is proportional 
to ¥1+ n (stimulated emission), so we can emit photons even if n = 0 (this is called spontaneous emission). So 
in a cavity, having more and more photons makes it easier to emit more photons and that’s how lasers work. 
So now to solve our theory, we can commute the non-equal-time commutator 
dk 1 
(27)3 2k? 


Au) AY] = — Suu ee eh ea k=y) DY s), 


so that the Feynman propagator Is 


an e ik(x-y) ae lal 


DE(x— y) = OITEAMX)AHIO) = f Gaya Ba" 


which is the Green's function of Maxwell's theory: 


D2’ (x — y) = igh¥54 (x = y). 


So we can calculate the time-ordered expectation value just like for scalar field theory, with the only complication 
being sums over polarizations. Then the Feynman propagator is basically the same as the massless scalar Feynman 
propagator times —g"”. (And we're assuming that photons are massless, and gauge theory looks naively wildly violated 
if we have nonzero mass, so we won't get into that here.) 

So summarizing everything, we see that we have the Dirac Lagrangian #(i@ — m)w and the electromagnetism 
Lagrangian —$F*"F,, — $(0,A“)*, and now we will just add an interacting Lagrangian —epy"pA, = —epAw 
(where e = V47rq@ and a is the fine structure constant). The overall quantum electrodynamics Lagrangian is then 
the sum of these three terms. 

Writing down this particular interaction term is not super motivated at first, but we'll see that it has the properties 
that we want. We can then calculate the S-matrix elements: 


co 


S =U(T,-T)= ie)" / dx, de xnP(x1) ACA) WO) =: Bxn) Amn) 0%). 


n=0 — 


We wish to calculate the overlap S¢; = (f|S|/), which we still claim (but haven't proven) is 9 (f[|U(T, —T)|/)g where 
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we only sum over connected, amputated diagrams but compute in the free theory states. We write down 


o- on aR, oe 


1/2 


for creating a positron at x, and similarly wy creates an electron, w~ absorbs an electron, and w absorbs a positron. 
We also have that At and A~ create and absorb photons. We can represent these all in Feynman diagrams, with 
positrons and electrons being emitted or absorbed based on outgoing or incoming vertices, and photons represented 
with wavy lines. Wick’s theorem tells us that we can contract W(x)Py = S-(x — y), and we similarly see here that 
A¥(x)A’(y) = DE’ (x — y). So in the Feynman diagram formulation, we claim that —iepAw corresponds to a vertex 


with a photon and an incoming and outgoing arrow. Then the first-order S-matrix is 
Se = “ie [ axT BAW) = ie [ ato Axu(e): a ae 


The normal-ordered product basically means that we can “direct time” in a variety of ways at vertices — we have 


three-point interactions such as a photon creating a positron and an electron. But 
(ep) =p, 0 = 2m? + 2pe-Pet = 2m? + 2Ee-2Ee+ — 2|Pe-||Pe+| cos 0, 


but that cannot happen and thus there is no three-particle scattering at this first order. So we have to expand the 


S-matrix some more to get the interactions to show up, and we'll see this more next time. 


15 November 17, 2022 


We introduced quantum electrodynamics last time, writing down the QED Lagrangian 
—): 1 1 = 
Leaepd = Wis _ myp — qiwre _ 5 (OuAty? ~~ epAy, 


where the first term corresponds to the free Dirac equation (fermions — electrons and positrons), the next two describe 
free electromagnetism and photons, and the last term describes interactions between them. We found that the 


Feynman propagators look like (these are contractions os wy and A¥A”, respectively): 


dtp | i(p +m) d*p _, —iqh’ 
—y\)= | —eipe-y)_ AF perry ey) = —ipoxy)_ 
Sr(x—y) lope Pi apeae oR love pe +i0° 


We want to do calculations in perturbation theory, since we can relate the electric charge e to the fine structure 


constant and find that e is quite small. So we can do perturbation theory to study scattering processes; using all the 
tools we found that the overlap of the S-matrix S-; can be calculated in terms of the the time-ordered product of 
the interacting Lagrangians, integrating over n points at nth order. But then we found that there's no three-particle 


on-shell scattering (y = et + e~) by kinematic momentum calculations at first order. 


Example 30 
We'll study Compton scattering today, which is the idea of shining a photon on an electron at rest and seeing 


what happens. We'll generalize this to a process of the form y +e — y+e, where the initial and final 


momentum and polarization of the photon are p;, A + p3, X’, and the initial and final momentum and spin of the 


electron are p3,5 — Po, 5’. 
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Our initial state is thus 
li) = |e“ (po, s)9(p1. A) = V/2E12E 2a}, (p1)al (pe) 10) , 


and similarly we have 
If) = V2EsEaal,y(p3)al(pa) |0) . 
We thus want to find S¢; = 9 (f|S|f)g to help calculate scattering amplitude, and we care about the nontrivial part 


where something actually happened to the electron and photon. It'll thus look like d;¢ + i(27)6“) (p; — pp)M for some 


constant M which we want to find, and the first time we will actually see a nontrivial term like that is if we expand to 
second order (first order vanishes as we saw last time). So we want to calculate the term 


1 
5 (ie) V16E1 ExEsEs (0 0) ; 


We're only getting something nonzero when we contract all fields together, and we can only contract operators of 


2(oy(a)acay(ba) f abeayT {pyAxbxPy yWy ls) (P2)4{y) (01) 


the same species of particle. For example, a(./)(P4) could be contracted with al (po), and a,)(p3) could be contracted 
with Alyy (Pt), and then we can contract AxAy, WW, and py wy. But that’s really just free propagation of the photon 
going to itself and the electron going to itself, so it shouldn't be included (the diagram is not connected, and we just 
add this to the overall vacuum energy). 

Next, we can contract a(.)(p3) with , and wx with al.) (P2): Wy with yy, and then connect a(,)(p3) with Alyy (P1) 
and contract the fields A Ay (So at x, we create a photon, which connects to a spacetime point y where a fermion 
line is connected to itself.) But this falls into the “amputated” consideration — the propagator is coming from outside, 
and again we don’t want to count this case. 

So now we can turn to something more nontrivial — we're learning that whatever we have on the outside should 
be contracted with something we have in our S-matrix. So we take our fermion ag/(p4) and contract it with w,, and 
we take our photon ar,)(p3) and contract it with A,. We can contract a, and py in the middle, and finally contract 
Ay with aly) (P1) and wy with al, (P2)- Similarly we can do the same but have ai,)(p3) Connect to Ay instead of A, 
and have ly) (01) connect to Ay. (In fact there are two of each instance giving the same Feynman diagram, which 
compensates for the 5 factor in front — this is down to being able to exchange x and y without changing anything 
else.) We could then do a lot of calculations, but it’s a fairly tedious process, and it’s easier to do these calculations 


with Feynman rules: 


« Represent electrons going in with arrows in the upper right direction, corresponding to a u,(p) spinor term, and 
represent positrons going in in the lower left direction, corresponding to a Vs(p) term. Similarly an electron going 


out is a U;(p) term and a positron going out is a vs(p) term. 


- A photon coming in is a €,(p) term (corresponding to a wavy line entering a node on the right), and a photon 


going out is a €7,(p) term. 


+ Propagators of fermions correspond to em terms, and propagators of photons correspond to a. 


- At every vertex we have a —/ey“6,,. term corresponding to a three-particle interaction. 


¢ Momentum conservation must hold at every vertex. 


Closed loops correspond to a f d*p integral. 


* We get a (—1) factor for each closed fermion loop WyWiWov2 = (—1)tr(WiWy Woo) = (—1)tr(Se(m — 
X2)Sr (x2 — X1)). 


+ Orientation of arrows matters, and they must run in a consistent direction around loops. 
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So in Compton scattering, we have y and e~ coming in and y and e~ coming out, and there are basically two 


potential diagrams to consider: 


P41 P3 P1 P3 
D, D> 


P2 Pa p2 P4 


There are no negative signs or other issues here, so we can just use our graphical Feynman rules. We always start 
with a fermion line and look at an outgoing arrow to write down what we get from each: for the first diagram D we 
get (here we require p3 + P4 = P34 to be the momentum at the right vertex, and p; + po = P34 by the conservation 
at the left vertex) 


= E (Poa + m) : * 
= Us'(pa) - (—ie)y" - BE, = me p i9 6 (ONY Us Pa) E(\n (P3 EQ) (1): 


(So we've “gone off-shell” in the middle of the diagram so that momentum is conserved, but (p3+p4)* 4 0 in general.) 
Similarly, in the second diagram D> we get 


ipa +m 


2, — m+ i0 


Tg: (pa)( iatiee (—ie)y"us(P2 ex" (Ps )eX(P1)- 


So this process of writing down all diagrams and using the Feynman rules gets us the total scattering amplitude more 
easily. (This is really the Born approximation, since we’re only looking at tree-level diagrams.) And this tells us that 


— 1 11 d*p3 d*pa 
2/(2p1p2)? — 4m2m3 Ns, Ns. J (21)* (2m)4 


MEDi Ds 


215, (p3 — m?)2m6,(pq — m?)(2m)*6 (p, + po — ps — pa)|MI?, 


where we'll discuss what the bar over M means soon. (We derived this for scalar scattering early on, and it’s the 
same idea here.) Here note that m, = 0, = me, m3 = 0,14 = Me, SO we really need to have pe = 0, and 
Nz, Ns, correspond to the number of spin states for particles of the two types (1 for scalar, 2 for photons because 
of polarization, and 2 for fermions because of spin-up and spin-down — we'd have something bigger if we had higher 
spins.) And the point is that M is the matrix element in scattering over all final states, but we also want to sum over 
all initial polarizations, but then we have to average out (divide by the total) because we just have a single state 
when we start. So we should take the average of our scattering probabilities if it (for example) doesn't matter what 


the initial polarization actually is, which is why we have the factors of Nz, and N;,. This means 


[M? = S00 IMI, 


A,r’ s,s! 
where |M|? = DD} + D,D} + D{D2 + DD} and 
ie? — A.M Vv * tT tt tO sé 
Dy, = eae uay (pro + m)y UZE WE, —> D; = Se Hee) (pi2 + m)y Y UsE Ev. 


But now remember that °° = 1, we can insert 7° into various places and use that y4t = y°y#y° to write 


2 
fer S. 
Drs Zo ae 2°" (pi2 + my uge ev. 
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So we can calculate the first term in the matrix element |M|? now: 
ef 


S- S- Dy Di = G- mp a Eu(P3 Ep (Ps EL (Pi )Eo(P1) 
dX 


A,r’ s,s! 
S- Ty,s¥" (p12 as myY" UrsUr.s° (p12 + M)Yptyp,s')- 
s,s! 


We can make use of the identities now that 
Dek (erex(rr) = —9M, Slur sins = potm, D)uy.sTy.s = pa tm, 
aN Ss s! 


so we can contract Lorentz indices between yy matrices and end up with 


= (s ape [(Pa m)y" (p12 ae my)y" (po a m)yv(pr2 tb m)Yu] , 


One of the things we can do for simplification here is to set all masses to zero to make our life easier (for example 
high-energy scattering), so that we want to calculate Str [Pay B12" po Piz"). Then y4y, = 4/ and y“pyy, = —2p, 


and remembering that S = (pi + po)* = pep is the center-of-mass energy, we have 
d* e4 
D,D! = <2 Atrlpaps papi] = — 5 8su, 
where u = (p; + pa)? = pz,. We then find similarly that 


4 
DD} = DtdD, =0, D,Di = —<8su, 


j2 4 2 2 2 z 7 < ‘ . 
so that |M|- = —Z(s* + ur). Plugging this back in and integrating out all the delta functions, we find that the 
scattering cross-section is 
i e*(s? + u?) 
ee = AO 
C aa | su 


where Q3 is the angular element in three dimensions. Plugging in po + p4 = 2E2F3(1 — cos@), we find that 


u?+s* 1+ cos? 
su. — cas 


so the differential cross-section for no mass is 


do a?1+cos?6 


dQ As a 


On the other hand, if we had kept the mass term we would have found a similar result with additional corrections: 
Mi? = Be" fea Us fies y ane a ies). Yi ; 
Sat em: SS i= ee s—m °° u—m ; 
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We've now discussed a few different quantum field theories with different Lagrangians: looking first at the interaction- 
free case, we have the spin-0 LE!" = $(0,6)? — $m¢?, the spin-5 LP" = Pid — m)y, and the spin-1 LHe! = 


= FuvF YY — $(0,A“)?. We then calculated the two-point correlation functions for each of those cases, which are 
+m) 


i i(p- m 
p2—m?+i0' p2—m?+i0)’ 


igh’ 
p?+i0° 


represented with arrows in Feynman diagrams, to be and (Higher spins can arise, but in 
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the standard model we just have these three cases.) Introducing interactions means that we have to introduce new 
local operators: for quantum electrodynamics we add a —iewAw term to the Dirac and Maxwell Lagrangians, and to 
get the Higgs boson and t-leptons we add a — 2 hap term instead. 

We then computed the S-matrix using perturbation theory as a sum over amputated, connected Feynman diagrams, 
in which we associate particular pictures (vertices in quantum electrodynamics, for example) with a correspponding 
mathematical expression, which allows us to write down expressions for probabilities of particular scattering processes. 

Everything so far has been classical — our next step is to figure out quantum corrections, which come up when we 
expand further in perturbation theory. (Remember that there have been factors of fi in our expansions of time-ordered 
series, and taking f — 0 is recovering the classical limit.) So we'll be discussing renormalization throughout the rest 


of this course. We'll be looking at our ¢* Lagrangian for simplicity 


> 9 


| 
ae = Le ar ii 


and the idea is that we want to look at Feynman diagrams with loops, which give us expressions that look like 


t Cn = —— (if we attach a loop of momentum k to a segment with endpoints p, — this is called a tadpole graph) 


i 
or {2a oe t me H10 (KEP =P mE (if we have incoming and outgoing segments of momentum p, P2 on both sides, 


and the momentum is k and k + p; + p2 on the two sides of our loop). And the point is that as k >> m, p1, po, the 


tadpole graph contribution approximately becomes f oh ~ i an f dQ? ~ |k|?, and similarly the other expression 
diverges as ak ~ log(k). So we get quadratic and logarithmic divergence respectively, and we'll now discuss how 
bad that really is for probability calculations. 

Suppose we have a Feynman diagram with n vertices, E external lines, / internal lines, and L loops. (So in the 
tadpole graph above, we have (n, £,/,L) = (1,2,1,1).) By counting graph degree, for the ¢* theory, we must have 
4n=E+2I. |f we then work in d spacetime dimensions, we now have f d¢k integrals, so the integration measure of 


any diagram goes as d- Ll (Since we get an f dt%k for each loop) 


Definition 31 


The degree of divergence of a diagram is D = d- L — 2/ (where we're basically counting powers of k). 


For example, we can check that D = 2 and D = 0 in the two cases above. It turns out we also have L = /—(n—1) 
because of the requirements of the momentum conservation at each vertex. And this is good, because it means we 
can write everything in terms of n and E (which are actually the values that are relevant in a physical situation where 


we are looking for scattering to occur): 


D=d of) Patiga4 
(S-1)E+ma—4) 


and in particular this is 4— E when d = 4, which is good because we do not make our diagrams worse as we add more 
and more vertices. (This doesn’t mean there will be no divergence if E is large — this is a superficial degree just looking 
at what happens to k, and it’s possible that we have subdiagrams that diverge and are only isolated to specific parts.) 

So we can speak of renormalizable quantum field theories (in which D is independent of n, for example if d = 4 
in our specific theory above), super renormalizable quantum field theories (in which D decreases with n, for example 
if d = 2), and non-renormalizable quantum field theories (where D is increases with n, for example if d = 6). On 
the other hand, gravity in d = 4 turns out to be non-renormalizable — it’s one of the reasons why we think we don’t 
have a quantum theory of gravity. 

What we want to talk about is what happens in QED. There, we have n vertices, pe external y photon lines, 


p; internal ys, Ee external electron lines, £; internal electron lines, and L loops. The same kind of graph-theoretic 
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considerations (remembering that at each vertex we have one photon line and two electron lines) leads us to n = pe+2p; 


and 2n = E. + 2E;. Then because the photon propagator goes essentially as ra and the electron one goes as £ ~ a 


we have the superficial degree of divergence 


d-4 d-1l d-—2 
D=d-L—E;—2pj=d+n- 9 9 Ee 9 Pe, 


which is 4— 3E. — pe for d = 4. (So in particular, this means that QED is renormalizable.) The divergent diagrams are 
those where we basically have “electron self-energies,” meaning that we have internal photon lines branching off of a 
single electron propagator (so D = 4—2. 3 = 1), or where we have “photon self-energies,” where D = 4—2 = 2. We 
also have the case where we have “vertex corrections’ (for instance, taking a usual interaction vertex and connecting 
the two electron lines with a photon), where D = 4— 3 -2—1=0 (logarithmic). And of course, these are all divergent 
if they are embedded into a larger Feynman diagram as well. There are other diagrams as well, but they will cancel 
out when we implement them into larger diagrams. 

So it’s just the three types above that we need to worry about, and our goal now is to do something to deal with 
them. We'll do this using something called regularization — one method Is to perform cutoff regularization, where if 
we have some loop integration ihe de we can define that to be liM~soo fo afl and rearrange all the diagrams so that 
the A-dependence actually goes away. But the common technique these days is to use dimensional regularization 


and say that we're working in d = 4 — 2¢€ dimensions, taking ¢ — 0. 


Example 32 


d¢k i 
2m)? k?—m?+i0° 


With this method, our tadpole graph contribution is now 2 i ( 


In order to turn this into a Euclidean integral, we do a Wick rotation where ko = ikg, so that our poles in the 
k® plane are now rotated by 90 degrees. Then k? = (k°)? — k? is now —k2 (so that we don’t have the complicated 


Minkowski metric with a minus sign). Our integral then becomes 


Dy | dt ke 1 
2 J (2n)4 —k2 — m2 + i0' 


introducing spherical coordinates (this is the equivalent of d?x = r?drdQ. in three dimensions) this becomes 


IX i 1 
= —— d|ke||k, #1 fan ———————.. 
oi). [kell ke 42 — m2 + i0 


To compute this, we will use the useful identity 


© dlkel2 (k2 d/2-1 d d 
i kel? (ke) é) =(Ae (1 7 (2 =), 
4 2 ea 2 2 


where [ is the usual gamma function (related to the factorial) with F(n+1) = n! and F(d+1) = dI(d) for all d. This 


function turns out to help us with calculating the area of a sphere in d dimensions — remembering that /7 = [ dxe-*, 


raising both sides to the dth power tells us that 


ml? = f dicen Ein = f a | d|x||x|¢-te- FP 
0 


where again we've split into spherical coordinates. But this last expression is 


dM)? pias td ani 1_/d 
dou f xP ye ee = fan —T(—], 
/ af 5 (1x1) dol \ 
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_ Vm)? 
(3) 


—jirntl? og d d 
contribution he th a ST (2S = |, 
epee 2(2m)T (8) ( 5) ( 5) 


so we find that | Qg . So plugging everything back in, we find that the contribution from our tadpole graph 


iS 
and now if we plug in A = m? (as it is in our particular integral) and d = 4 — 2e we get 


ir m\~* ,f(1—e) 
= ~xae (E) ee 


We can now use the series expansion of Tas 


1 
r(l+e)=e % 14 Se? See tee: 
2 8 
where Ye is the Euler-Mascheroni constant (around 0.577) and ¢, is in terms of the Riemann zeta function: Cy, = 


yr ko”. So at leading order we see that 
im 1 
2(4m)? €' 
and thus the divergence shows up as a pole in 2. So we can now calculate in this fractional dimension because we've 
managed to make the divergence manifest. (And also, in dimensional regularization, we'll set [ dtkzs = 0 because 
there’s no “relevant scale.”) So the idea of renormalization in general is the following: we start with a Lagrangian 
L= $ (On)? — smo? = Oe where we can say that A can be the Ap in experiments times a factor Z,, and the fiedl 
@ can be op from experiments times some factor ,/Z%, and m? is then HZ i The point is that these Zs will encode 
the divergences, and then redefining our Lagrangian in terms of the renormalized constants makes the singularities 


go away. So we now have 


1 2 1 942 or 44 

L= 5 (Our) Z6 ZmZo5 ROR 2267; Rs 
and the correlation functions in vacuum (in momentum space) now become Pome instead of FORD: We can 
now do a perturbative expansion in terms of our coupling constant: say Z; = 1+ ARO? + 2.67) +--+ for each /. 


Then if we want to calculate all possible interaction terms occurring from a single line segment, we get the tree-level 


two-point function, plus the one-loop correlation function, plus order 2 terms, which now looks like 


iz | iz ( a iz) L002) 


p?—m2Zm+i0 p?—m2Zm+ id \ 2(4m)? \ 4a € p? — m2Zm+ i0 
This can be broken up in terms of our perturbative factors into 


a | pcs 5) 16? m2, iARM? me im r(1+e) I ; O(22) 
pe—me+io\ 9 Oe © po me +10 24m)? \ ae — p2—m2+i0- 


P 2 =—€ 
and now if we choose 5 =0, 6) = oer (32) ruts) | then we find that the renormalized two-point correlation 


function is the renormalized tree-level diagram plus O(A2), which is just F . So we've removed the tadpole 


i 
2—me+i0 
diagram contribution from our calculations entirely by choosing the appropriate 6 coefficients. And in all calculations, 
we can use this kind of redefinition universally — we'll get the same 6 because the same singularity appearing in the 
two-point function can appear in more complicated diagrams as well, and it turns out we will only ever have a finite 


number of such problems that we need to remove. We'll take a more systematic approach to all of this next time! 
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17 December 1, 2022 


Last time, we started discussing divergences and renormalization — we saw that certain diagrams with loops give us 
k-dependence like k? or logk. To characterize the degree of divergence (at least at a superficial level), we define for 
any diagram a number depending on the spacetime dimension, number of vertices, and the number of external legs. 
For example for the ¢* theory, we found that Dy =d (g 1) E+n(d—4) = 4-6, so at least the theory is 
normalizable (D is independent of n). Similarly for QED, we found that Deep = 4— 3E. — Pe. The way we deal 


with these divergences is with dimensional regularization, where we consider our spacetime dimension as d = 4 — 2€ 
and take € — 0. Then the renormalization corresponds to redefining couplings, masses, and fields with power series, 
perturbatively expanding with coefficients 6), which will be “counter terms” to absorb the singularities that we see in 
our problems. (Indeed, we saw last time that choosing a specific 5) and 6) allow us to avoid the tadpole diagram in 
our renormalized two-point correlation function.) And the constants like Ag in which we are doing perturbation theory 
can be actually physically measured by looking at scattering cross-sections and comparing to real experiments. But in 
most quantum field theories we cannot get exact formulas for these expansions (calculations usually use around 5 or 
6 loops for things like QED or the ¢* theory). 

Today, we'll think about QED in d spacetime dimensions. We have S = foo and S should be unitless, so £ 


should have mass dimension [L] = d. Writing down the Lagrangian 


— 1 = 1 
L= pid — mp - qi Fe — epAy — 5(OuAr)*, 


we know that [m] = 1 (that’s how units are defined) so [p] = ae and [0,] = 1s0 [A,] = g — 1; plugging into 


the eWA term shows that we must have [e] = 2— $. Thus, we will replace e with e(ur)?~%/* so that we have a 
dimensionless coupling constant, and fze will carry our mass dimension. And the mass dimension ends up telling us 


how badly operators diverge if we put them in a loop (higher means worse). 


Remark 33. For Dirac algebras in dimension d, we then see that n“” has entries (—1,1,--- ,1) on the diagonal with 
7’, = d. We'll still have the Clifford algebra identities {y"*,y’} = 2n”, yey, = d, yey’y, = (2 —d)y’, but 
importantly the trace of the unit element tr(1) is still 4. So we're not changing the dimensions of our y matrices 


themselves, and that's bad because we don't have a good ys in general. But we won't go into that much here. 


We'll thus redefine our parameters via 


Zi d/2-2 
e ' 
Z Zz RL R 


where Z) = 1+ ae 50) +--+ are again defined as perturbative expansions. Our QED Lagrangian is then 


p=VZovrR, Av=VZ3AR, m=m,+dme= 


_ 1 » a = 
Loev = Pri(d — Mr)vr - a ruRR —- 5 (OuAR) — erPpArvr 


1 , 6 nee a 
~— 753FurFR — 5 (uA)? + Pp(iPd2 — 6m — mRbo)Wr — UeerWpAnWr 


(everything in the second line is extra terms from renormalization) and we can now adjust the counter terms to get rid 
of the divergent diagrams. The two-point correlation function [ d*x (O|T {h(x)p(0)}|0) contains a contribution from 
just a straight directed line segment, then at order a we have connecting two points on that segment with a photon 
line, at order a? two different contributions from two photon lines coming off of that segment, and so on. We'll define 
a concept here: we are 1PI (one-particle irreducible) if whenever we cut one line, the diagram does not fall apart. 


(So one of the two a? contributions is 1PI but the other is not.) Then let the series consisting of all 1P! diagrams be 
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& (they basically consist of diagrams where all photon lines are concentrically nested) — our correlation function then 


consists of just the fermion propagator term with no additional lines, em = = i(p m)~*, and then the 
diagrams from all 1Pl contributions, then the diagrams where we have two disjoint blobs of 1PI contributions, and so 
on: thus the contributions to our integral are 
/ i / , / i i _ / 
pom pote 


yr yr y ee 
Pan Pom” Pn Pon Pom 
by doing a geometric series. We now need to set up our renormalization schemes (conditions): we want, at all orders, 


; 


i dt xelPx (0|T (P(x) (0)}]0) | pone = jm, 


We thus need (plugging in renormalized quantities on the left-hand side) 


ee - j _ i(1 + 62) 
p—m—dM+10| 4m — p—m—iX(p2 — m2) — (p— m) Flee t P m)? (me) (1-1), ) 


p=m 


so the two conditions we must have for singularities to go away are 
dm=iX(p?=m’*), 6. =-i — 


at all orders. (This is called the on-shell scheme.) So we'll now go through the self-energy corrections and see the 


calculations more explicitly: 


Example 34 


Consider the diagram with a single photon line loop, where the momentum is p — k on along the photon line. 


Then the contributions within the loop are part of 2, and collecting terms gives us 
i a i(K+m), . —in 
yO) = in v yy 
(om) rey yaa IY Waa 
( 


(207) (k2—m?)(p—k)?2 (2m) k2 — m?)(p — k)?' 


where we've used the y matrix identities. We'll now introduce Feynman parameters, where the trick is basically that 


‘l. co 1 co co co co co 
—— | dxe A => —_ = | i dx, dx e148 — | | i dydx,dx26(y — x, — x2)e 14-28 
A 0 AB o Jo o Jo Jo 


and now rescaling x; and x2 by y times yields 


= dydx,dxoy6(1 — x, — xo)e MY ary dx, dx» —_____—~.. 
| | | y dx; dxoy6( 1 2) ‘ 1X2 (Ax, + Bxo)? 


Plugging back in, we see that the integral we want to evaluate is 


@ d 
= ony (k2 — ae Zap? = mye | adel — X, — x2) [(k? m2) xy + (k2 — 2kp 4 p2)x] 


h 


since x2 = 1 — x, the bracketed term is k* — 2kp(1 — x1) — m?x; + p?(1 — x1). Doing a linear shift k + k+ p(1— 1), 


we thus have 
2 


d 
= | pa | enarott XI x2) [k? | x1((1 x1) p? m?)| 
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Then performing a Wick rotation like before, this simplifies to 


iL 2 
- | aba [| SEE)? doL-Ke — aY, 


where A = —x;((1—x1)p?—m?), and now we can just calculate this integral because we've converted to the Euclidean 

metric: we have Q4 T(d/2)F(2— 4/2) 

I\dq = 2 2 d/2—2 

= dx1(— Li = : 
2(2n)4 F(2) ; x1(—(p*( x1) — m)x1) 


The point was that we have something quadratic in k? — after some manipulation there's no mixed term kp and thus 


h 


we can turn our problem into a spherical integral, using that f dxx?(x + A)? = Alta—b Uta) 1 (6). We only really 
care about the counter terms here, though — expanding the result around p? = m? (because that’s where our pole is), 
we have 


: d-4,d-4 p> — m? d—4.d—5 2 2\2 
| dx (xf "mo + (1 = x1) mx + O((p* — m*)*)), 


and so we have 


(i+ Ne =2E i 
"= G any a E (p° — m) =m + O(p” m?)?)). 


But now we also have to deal with the fact that there is a K in the denominator, and we do so by defining 


d¢k kK 
ee i) (2m)? (K2 — mP)(p— kp 


doing exactly the same thing yields [ a ie dx, [K(k? — 2kp(1 — x1) — mex, + p2(1 x)], which (after the shift 
k++ k+ p(1—x,)) becomes 


d Ae 
| amy ff K+ 9-0) [tO 9) mY, 


and in fact the contribution from the only remaining & in this expression will now cancel out because the parts with k 
and —k do. Thus 


ae iT(1 +e) ame |. pe im 
Io 2(1—e)/ (1 — 2e)e(47)? (=) 1 anes 


We then also find that 


iar ( m? \ * 3—2€ 
Y|e=n2 = — r(1 =/ 
lp=me Ane (3) Pee) 1 —2¢ (00) 


by definition, and thus in order to satisfy the conditions we must have 


d=| dm 


62 = -i—- 
dp pom m 


So the point is that in the process of renormalization, we introduce a finite number of parameters that redefine 
couplings, masses, and fields, and in regularization we set the number of dimensions to 4 — 2€. We then want to get 
rid of the counter terms. The on-shell scheme then shows that the fermion propagator (vacuum energy of the electron) 

i igh 
ae PO 
is given by —ieF(q? — 0, m?)ay°ud + 6(q?), where ¢ is the electric potential. And in particular, F(0,m*) = 1, so 


iS to all orders in peturbation theory, the photon propagator is — in all orders, and a vertex (as q* > 0) 
we're not changing the electric charge to all orders in perturbation theory — the point is that E # Ep but Fp is just 
constantly 1. That makes £ = Lp + Leounter, and by defining our appropriate constants makes the Lagrangian look 


the same as we're physically used to — we cancel out the singularities in all diagrams that we can construct. 
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18 December 6, 2022 


We'll start today by summarizing the ideas of renormalization once again. Basically, when we have divergent diagrams 
(such as loops in a photon propagator), we characterize how badly they diverge using a superficial degree of divergence 
D (which was 4 — 3E. — Pz for QED, where Ee, Pe are the number of external e~ /e* lines and ¥ lines, respectively). 
We then regulate this divergence by setting dimension to d = 4 — 2¢ and take € > 0 in a controlled way — we can 
then redefine (renormalize) parameters and fields in our theory to cancel out the counter-terms. Specifically, in QED 
we set p = /Zoppr, AM = /Z3A%, m= me tm, and e = s“+-pSer, where each Z; = 1+ 6; can be expanded 


Zo/ 23 
in perturbation theory. The point is that these corrections from 6s only come into play at higher order, and we found 


the renormalization conditions for QED last time (demanding that all potential loop contributions from the electron 


propagator at p? = m>, should yield Z and that all potential contributoins from the photon propagator at 


i 
2 Pan] 
mpt+i0 


p* = 0 together are =a and a third condition coming from the vertices) — we then ended up with the equations 
a 1 ; O (1) dé 
om! ) = iL| po, ~o3) = ~i apleane 


Then because the self-energy of electrons, photons, and vertices are the only situations in this theory with a superficial 
degree of divergence, every potential problem diagram will have one of them as a subdiagram. So this renormalizes the 
whole theory properly. (We should remember that the observable actual values of our parameters are what correspond 
to physical values — the original ones are just badly divergent and incorrect.) 

The problem is that there are other issues with quantum field theory that we've ignored and only hinted at so far in 
this course. We've dealt with ultraviolet singularities (loop momenta going to oo) using renormalization, but we can 
encounter infrared (also called collinear) singularities at low energies as well. For example, introducing a connecting 


photon line of momentum & near a vertex by connecting the two electron lines give us [ d*k and when 


1 
k?(k+p1)?(k—p2)*" 
pe = pe = 0 and k* ~ 0 (remember these are still four-vectors) our integral is basically 


fo 1 = f Mee d2Qx se dQ4 
(k?(2kp1)(—2kpo) 2 k?(2kp1)(—2kp2) E? (1 —cos@1)(1 — cos x2) 


because kp & —E,E,(1—cos xp). So if we take Ex — 0 (so very low energy and very high wavelength, corresponding 


to photon interactions across huge distances), or if that photon line is almost collinear with p; or p2, connecting the 
diagram also diverges and this is a different issue from that in renormalization (in which we imagine that the photon 
line is getting very close to the vertex). 


The solution is basically to not just add loop diagrams but also define our observables accordingly. A scattering 


process e& +e7 > yY— w+ with an additional y line connecting 4 and ~~ must also include corrections where an 
actual photon is emitted — these might look completely different, but in the limit where the photon energy goes to zero 
the processes will look experimentally identical, and there is a singularity that corresponds to the infrared singularity 
in the original diagram. The same is true of collinearity (since detector can not resolve the difference between the 
muon and photon, and adding the quantum numbers gives us back the original muon). So the point is to think about 
the cross-section a as a sum of the real diagrams and the virtual diagrams (we need to make sure the “infinitely-long 
distances” don’t end up contributing to our physical result), and if we want to learn more about this we should search 
up the KLN theorem. 

What's nice, though, is that dimensional regularization can regularize both ultrviolet and infrared singularities. 
In the previous lecture, we looked at the on-shell scheme, and when we subtract off only the =o singularities from 


ultraviolet that’s instead called the minimal subtraction scheme (only removing poles). We may also use the modified 


minimal subtraction scheme (MS) in which we add some constants such as modifying the coupling constant e = 
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e/2 
Zoe () e®YE ep(u2) to make our expressions nicer. (And this is in fact most widely used in calculations.) 
this MS scheme, we have 
ss 3a wies a 
Ss : R S R 
OM = ilpamuv =—Za A = —F (0, uv) = — Ze 
ws dh QR MS QR 
(Oe py, SS Se, 
2 I gplPamuy Lire 3 2(0) sae 


(This last expression is the total contributions from the photon self-propagator at p* = 0 in the ultraviolet correction.) 
The point is that these expressions look much neater than before, since there are no finite terms and we just have 2 
type terms. And this is valid at order ag — it turns out we need to calculate infrared finite observables to figure out 
how to deal with these cross-terms to remove the remaining singularities and figure out the counter terms. 

If we translate between different renormalization schemes, we can relate the parameters between them. For 


example, we have that 


MS 2 908 Os MS Os ae 3 mp OS)\2 
mp mp + 6m>> — 6m™? = mp? {1 e 14 q [09 58 + O((aR”)*) 
R 


(here OS means “on-shell’”). And groups like the Particle Data Group always define real physical quantities relative 
to specific renormalization schemes — for different purposes, different schemes may be more useful, and because our 
series in renormalization aren't completely divergent some schemes may do worse than others in that sense. 


We can now talk about the running coupling — the fine-structure constant a = = becomes the renormalized a 
in the MS scheme: _ 
MS 2\€ 
a= aR” LR ef VE 
23 An 


(here notice that ag must depend on 42) because in this case it turns out that Z; = Z) and = =1 +2 ae ey O(a2). 


But now the left-hand side is independent of u2, so if we apply Lie aut to both sides we get 0: = B(ar(u%)) |, where 


B is the beta function of our theory (telling us how th ecoupling constant changes in terms ofour scale ir). 
2 QR 3 
B(uR) = ax [60% +B, (*) + (=) 4 | 


with (matching coefficients with the form of =) Bo = —}. So that means that 


O ows am (Ue) 
Mraz oR (HR) = oe R" + O(a). 


We can then try to solve this renormalization group equation 


dar Bo > dar Bo 
— => z= eo 
dlog(u2) a. az, og(uR), 
and integrating both sides yields 

Bo, UR 1 1 2 ar(HG) 

log = Ar(Up) = . 
mn” Ue are)  aer(UG) Mh) 1+ 2804) 8) log He 

0 


So comparing with the on-shell scheme, we find that 


MS, 2 os 1 — ays 
AR (WR) = as Ta gOs = OR” 1+ 2 Jog HE 
3 me 


so that aS (me) is actually just the ordinary fine-structure constant ~ a in the on-shell scheme. So ap starts off 
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137 ~~ Boor(Me) 
1.2 x 10?’8GeV, which is ridiculously high), or we can also think about 12, as decaying proportional to R? and 


at - at me- and goes to oo as U2 increases (in fact there is a Landau singularity at Age = Me exp ( atm) ~ 


approaching is as R — oo. So QED becomes stronger and stronger coupled depending on our renormalization, and 
the modifications come through this kind of calculation. But the pole means that at some point we cannot think about 
particle degrees of freedom anymore — each order in perturbation then starts to contribute a similar amount and the 


probability of any number of particles interacting is essentially the same. 


Fact 35 

In QED, because G < 0, the Landau pole occurs in the UV regime. Meanwhile, some theories have G = 0 — 
they are called conformal theories, where the size of interaction doesn't change and there Is no intrinsic scale 
coming from quantum corrections. Finally, there are some systems with G > 0 where the Landau pole occurs in 
the IR regime, such as quantum chromodynamics. (This means that as the energy gets lower and lower, we need 


more and more diagrams to build into our amplitudes. Indeed, in QCD we work with quarks and gluons, and there 


we only have protons and neutrons and cannot talk about individual degrees of freedom for quarks and gluons.) 


There are some two-dimensional quantum field theories similar to the Ising model where we can see the transition 


between different coupling regimes, but this is still limited to toy examples. 


19 December 8, 2022 


Last lecture, we discussed the renormalization of QED in more detail, highlighting the differences between UV (short 
distance) singularities, which come from just a finite set of diagrams, and infrared (long distance) singularities. Specif- 


ically, we thought about different renormalization schemes beyond the on-shell scheme such as MS (the modified 


minimal subtraction scheme), in which our renormalization only absorbs the Z poles coming from dimensional regu- 
larization d = 4 — 2e, as well as some constants that make our calculations cleaner. And it’s important to remember 
that physical parameters like masses are dependent on the scheme that we use. In particular, there is no “actual mass” 
— in actual experiments we do measure the invariant masses by looking at a decay process and looking at the sum of 
the resulting momenta, plotting cross-section o as a function of that. Then there will be a peak at the top quark 
mass, and we could call that the preferred “mass,” but it’s still fundamentally a parameter in the Lagrangian. 

Today, we'll look at LSZ (Lehmann-Symanzik-Zimmermann) reduction. In the course of our renormalization 
procedures, we calculated the two-point correlation function (or specifically its Fourier transform): in scalar field theory 
this looked like [ d*xe!™ (Q|T{(x)G(0)}|Q) |,2-m2, and we found that this looks like a ar This pole at p? = m? 
was then interpreted as a particle having a well-defined mass m, which is an isolated particle at a particular location. 


If we then consider n points and calculate an expression like 
J= [ dxe™ (QUT {O(x) (21) +++ b(Zn) HQ), 


we now wish to identify the poles in p° associated with on-shell particles. To do this, we can split our time integration 
range into three pieces, writing [°° dx° = (- dx°+ fT, dx° + 7 dx®, calling those three regions region I, Il, and III. 
(Specifically, choose T so that its magnitude is much larger than that of any z®, so region | and region III correspond 


to times “much earlier” or “much later” than the significant contributions to the correlation functions.) In region Ill, 
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where x° >> Zz we can pull x out of the time-ordered operator (because it’s “happening much later’) 
Jin = fo dx? f dre (216COT Le) O20) 10) 
3 2 
= [are f xe [8 oer lb~Ila) (alT Ela) 6eMHIO) 
where we've inserted a complete set of states. We then have 
(Q[d(x)]a) = (Qle"”*G(O)e“'™*]q) = (Q16(0)| a) eo =e(a) 
where E = ,/gq2 + m2, so we can rewrite 
= el(p?—a")x° 3) al(B-G)-X _— 
sun = fo axe f SE rela f gPxel™O% (2I6(0)0) (TE): 620} 


and the point is that the only x-dependence is a delta-function [ d?xe'(?-9* = (2m)°6()(p — G), which we can then 


use to also calculate the d?q momentum integral. This thus all simplifies to 


lore) If p70 _ 7 0 
= [Pe FAP (a p(0)10. E(@)) (BERIT (C21) -- 8(2n)}12). 


where we're writing the momentum eigenstate in a funny way because we haven't specified p° yet. We can then carry 
out the last x° integration and end up with 
jel(p?-E(e)T 


2E(p)(p° — E(p) + 10) 


Ji = (Q|6(0)|6, E(P)) (6, E(P)IT {b(21) --- b(Zn) FQ) - 


So if there is a pole at p? = E(p), then the contribution to the correlation function as T —> oo for p® + E({) is of 


J NE (IT OCe) 8H), 


the form 


. (So the point is that poles propagate a 


where we've used that (9\$(0)|p) = VZ, seaaprteqte = 


single particle, and we're saying that we generate a one-particle state at aes future times if we take the Fourier 


1 
p?—m2+i0 


transform and evaluate it near an on-shell pole.) 


Next, we can look at region |, in which 


-T 
n= [ dx° / Pxe!™ (Q\T{H(Z1) + (Zn) }6()|2) 


0 0 


because now we're in a region where x* is much earlier than the zs. Again inserting a complete set of states and 


doing the same calculations as before, we see that 


iVZ 
lim J= lim ———>—— (Q\T{@(z,)--- 6(z,)}|—p), 
pi>-E(6) — p-+-E(p) P? — MRF jo SATA) O20) FP) 


so if we are forcing p° to be the negative energy (in this definition of the Fourier transform of the momentum p), 
we're picking up a particle pole sitting in the initial state with momenutm —p. And now we can repeat this for all n 


of our particles, and we find that 


i [loco Tray (QT {O04) += BOA) ++ BO%n) OO) ++ HUE) + GUY) FO) 


ee 9B) 


that is, if we send the final-state momenta to what we want them to be, we get single-particle-state energies. And 
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specifically sending the final and initial pe and pe to the corresponding energies E¢(pr) and E;(p;), we get 


Il iVZ Il i/Z 


= hlgiesae: HAT LIM peace 
pz — m2 + i0 caer rer rlT {1} IP 1 Pr) 


i=1 i=1 
and thus we can take an n-point correlation function (which is what we learned how to calculate in the first half of the 
course, at least for scalar fields), doing a Fourier transform, demanding that the momenta with respect to our on-shell 
one-particle states can be related to some propagator factors times an S-matrix element. And in fact we can turn 


this around and use it to define an S-matrix element: 
n : —1 m . a1 
iIVZ iVvZ 
ky,--- , kel T{1 titty = lim ———— ee 
(ky F/T {1} |p1 Pn) pace! semi (eer (zea 


J TL etxe® TL are (217 £802) (pr) 6(ka) ++ BC) }12) 
i=1 f=1 


Graphically, we can think of the 5 iVZ 


2—m?,+i0 


for all potential loops and other interactions inside), one for each of the n incoming particles and m outgoing particles, 


terms as the inverses of diagrams coming from a single propagator (accounting 


and the final term coming from sending in n particles and getting m particles out. So basically what we want to count 
is only the amputated (n+ m)-momentum-space diagrams for the S-matrix element. So the key is to take the n-point 
correlation function that we know how to derive, Fourier transform it with momenta corresponding to on-shell particle 
momenta, and then if we're in a situation with an isolated n-particle state we must pick up the right residue for the 
pole. (But if we think about this situation for QED, the photon is massless and everything becomes messy.) 

There’s now one more thing we need to clear up, the Ward-Takahashi identity (this is a totally new topic). 
Suppose we have a scattering of a bunch of particles including a photon, so that we can write the scattering matrix 
element M = e#M,. Then based on gauge invariance, we find that k¥M,, if k is corresponding to the photon 
momentum é€(k), and we want to understand where this comes from in generality for QFT. To do this, start with a 
fermion line and suppose we have n photon lines going into it of momentum gq, scattering in and “adding momentum.” 
So if the initial fermion momentum is p, we get the momentum py = p+ qi, Po = P1 t+ qo,--: , P' = Pn-1 + pn at 
various points on the line. Now suppose we add another photon of momentum k in between gq; and qgj+41 on the line, 
so that p; becomes p; + k and our final momentum is p’ + k. We then want to replace the polarization vector k# with 


its momentum e#. The insertion of a vertex can then be written as —/e((pj + K + m) — (p; — m)), So if we just look 


at the new vertex inserted we get 


param g m= 8 (Gm geben): 


For the general fermion line in which the diagram continues, we can then write the contribution as 


ie ie ) i 


tae aa ey ee im 


Meanwhile, if we insert into the neighboring location between qj-1 and qj instead, we end up with 


/ / ie le 
Ie 4d yt hi ( ) yyhi-t San 
Piri tkK-—m pitK-—m pi-1—m piitKkK-—m 
We can then compare the two diagrams and notice that there will be lots of cancellations if we add them together by 
telescoping. Adding across all possible insertion points, we get e times the original fermion line without any insertion, 
minus e times the fermion line with shifted momenta p+ k everywhere in place of p. And indeed if we think about our 


photon as part of scattering, it will interact with our scattering amplitude in some way, and that’s exactly summing 
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over all insertion points. We thus find that if we have a scattering process with n photon points, we have a loop 


amplitude (where momentum starts off as p; and everything else is conserved by momentum conservation at vertices) 


k*M,, = —e( iey? f oe (« E u <n 5 i = if E o — opt. ; a — |) . 


where one trace corresponds to momentums shifted and one not. But then by shift-invariance of the loop momentum, 


we can replace p; +> p; — k In one of the two traces, so we indeed have k*M,, = 0 for loops. However, if we have an 
external fermion coming into our amplitude, we shouldn't write the amplitude with propagators because we should be 


amputating our diagrams — we should write 


MY = = U(p' + k)(—i(p' + k — m))- (insertion diagram with photon k inserted) - (—/(p — m))u(p) 


insertion points 


where u is some spinor. Then the graphical identity we just derived simplifies this to 
= eUi(p'+k)(—i(p’+kK—m))-[(original insertion diagram) — (original insertion diagram with shifted momenta] (—/(~—m))u(p). 
(where “original insertion diagram” means we don’t include the photon k). Then we can use the Dirac equation 

(p — m)u(p) = U(p? + k)(~p+kK-—m=0 


and indeed find that k*M,, = 0 in this case as well even if we don't have a loop. So when we did calculations with 


Compton scattering, for example, we calculated square matrix elements and had to substitute for polarization sum 


and brute-force replaced eae eh) (k)€Q)(k) = —7”. But we should really be summing over physical polarizations 
0 0 
(only transverse in the direction of motion), so that wasn’t really correct. (If we have E4) = and E(>) a i we 
0 0 
0 0 0 0 k 
. _ {O 1 0 0 : : ; : O} . 
instead get the matrix eae .) So what's happening here is that if we have some k# = . with €(,)k = 0, 
0 0 0 0 k 


we can Introduce an auxiliary gauge vector n” that also satisfies n-€ = 0 and is independent of k. We can then create 


a polarization sum that is actually useful to us: consider 


1 00 0 
ken’ + k¥ nb kek¥ 10 0 0 0 
n-k (n-k)? 10 0 0 O 
000 -1 
and then we instead find that 
2 
ken’ + k¥ nb ke KY 
HO) (k)e’ (k) = BY } : 
dé ( JE) ( ) n n-k (n- k)? 


But when we calculate squared matrix elements like for Compton scattering, we're basically using expressions like 
MEP Mt, and by the Ward identity we just derived, we really only need the metric term because k*M,, = 0. So we 


indeed don't need to deal with complicated additional terms because of this graphical proof! 
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